ALGAE Protocol: The Data Dictionary

An automated protocol for assigning early life exposures to longitudinal cohort studies

ALGAE Data Dictionary

by Kevin Garwood

The ALGAE protocol produces 598 variables, each of which has a name that captures the context of how it was generated. For example, variables that begin with algae21_ mean they are address history variables that are part of the early life analysis. Each family of variables is described below.

Some of the tables and variables in the data dictionary appear highlighted in red. We have marked these as they may warrant special consideration by the information governance committees that regulate data sharing with cohort activities. See: Assessing Sensitive Data in the Data Dictionary



Variable Range Total Description
algae1100-algae1106 7 These variables describe the name and temporal boundaries of life stages that are used in the early life analysis.
algae1200-algae1206 7 These variables describe the name and temporal boundaries of life stages that are used in the later life analysis.
algae2100-algae2133 34 Early life. These variables describe the original address periods used in early life analysis and all the changes that were made to them so they could be used in an exposure assessment
algae2200-algae2233 34 Later life. These variables describe the original address periods used in later life analysis and all the changes that were made to them so they could be used in an exposure assessment
algae3100-algae3157 58 These early life exposure values are based on the cleaned mobility assessment method.
algae3200-algae3242 43 These pollution exposures for the early life analysis have been produced using the uncleaned mobility assessment
algae3300-algae3342 43 These early life exposure values are based on the life stage mobility assessment method.
algae3400-algae3442 43 These early life exposure values are based on using the birth address assessment.
algae3500-algae3557 58 These later life exposure values are based on the cleaned mobility assessment method.
algae3600-algae3642 43 These pollution exposures for the later life analysis have been produced using the uncleaned mobility assessment
algae3700-algae3742 43 These later life exposure values are based on the life stage mobility assessment method.
algae4100 - algae4116 17 Early life. These variables compare early life exposures generated by the cleaned mobility assessment and uncleaned mobility assessment methods.
algae4200 - algae4216 17 Early life. These variables compare early life exposures generated by the cleaned mobility assessment and life stage mobility assessment methods.
algae4500 - algae4516 17 These variables compare early life exposures generated by the uncleaned mobility assessment and life stage mobility assessment methods
algae4600 - algae4616 17 These variables compare early life exposures generated by the cleaned mobility assessment and uncleaned mobility assessment methods.
algae4700 - algae4716 17 Later life. These variables compare later life exposures generated by the cleaned mobility assessment and uncleaned mobility assessment methods.
algae4800 - algae4816 17 Early life. These variables compare later life exposures generated by the cleaned mobility assessment and life stage mobility assessment methods.
algae4900 - algae4916 17 These variables compare later life exposures generated by the uncleaned mobility assessment and life stage mobility assessment methods
algae5100-algae5104 5 Early life. These variables describe administrative areas that study members occupied on the first day of each life stage in the early life analysis.
algae5200-algae5204 5 These variables describe administrative areas that study members occupied on the day they moved to a new location, as part of the early life analysis.
algae5300-algae5304 5 Later life. These variables describe administrative areas that study members occupied on the first day of each life stage in the later life analysis.
algae5400-algae5404 5 These variables describe administrative areas that study members occupied on the day they moved to a new location, as part of the later life analysis
algae6100-algae6117 18 These variables describe aspects of how the original data sets have been cleaned and processed in the early life analysis. They can be used by researchers in order to isolate subsets of results and estimate the effect of data cleaning activities on results.
algae6200-algae6204 5 They describe aspects of cleaning and movement patterns within life stages of the early life analysis.
algae6300-algae6317 18 These variables describe aspects of how the original data sets have been cleaned and processed in the early life analysis. They can be used by researchers in order to isolate subsets of results and estimate the effect of data cleaning activities on results.
algae6400-algae6404 5 They describe aspects of cleaning and movement patterns within life stages of the later life analysis.