Context is King

posit::conf(2024)

Shannon Pileggi true
2024-08-13
Left hand side shows table of unlabelled data that has variable names only in the table header, overlayed with a sad monster face. Right hand side shows table of labelled data that has variable names and variable labels in the table header, overlayed with a happy monster face.

Figure 1: Screenshot of slide 24 showing the difference between viewing labelled and unlabelled data.

recording
slides

Abstract

The quality of data science insights is predicated on the practitioner’s understanding of the data. Data documentation is the key to unlocking this understanding; with minimal effort, this documentation can be natively embedded in R data frames via variable labels. Variable labels seamlessly provide valuable data context that reduces human error, fosters collaboration, and ultimately elevates the overall data analysis experience. As an avid, daily user of variable labels, I am excited to help you discover new workflows to create and leverage variable labels in R!

Citation

For attribution, please cite this work as

Pileggi (2024, Aug. 13). PIPING HOT DATA: Context is King. Retrieved from https://www.pipinghotdata.com/talks/2024-08-13-context-is-king/

BibTeX citation

@misc{pileggi2024context,
  author = {Pileggi, Shannon},
  title = {PIPING HOT DATA: Context is King},
  url = {https://www.pipinghotdata.com/talks/2024-08-13-context-is-king/},
  year = {2024}
}