Raw Data files
These are currently stored in the \System\Rawdata\16yr\ folder, and the list below refers to files and sub-directories within this folder.
The Access database has been periodically updated to keep it in the most recent Access format, to avoid software compatibility issues. Nevertheless, the essential data from the tables are also stored as exported "copies" in the form of the csv files in the Export folder.
- 16yr.accdb.
This is the Access database file (2007 format) containing aggregated and cleaned 16 Year raw data from the booklets/questionnaires and from administrative sources. (Note that it does not contain the web study data; the files of web activity data are generally very large, and contain far too many columns to be accommodated in Access tables.)
This Access database is now the master copy of the data extracted from all paper questionnaires, and is the source of such data for the analysis dataset. The important tables in the database are:- Parent1, Parent2 - data from the parent Behaviour/LEAP Study booklet
- Child1, Child2, Child3 - data from the child Behaviour/LEAP Study booklet (the Child3 table contains item data from new measures that were not in the cohort 1 Behaviour study, but were added for the cohort 2/3/4 LEAP study)
- Leap2Parent1, Leap2Parent2 - data from the parent LEAP-2 Study booklet
- Leap2Child1, Leap2Child2 - data from the child LEAP-2 Study booklet
- GCSEresults, OtherResults - data from the paper GCSE form. The former table contains results for short-course GCSEs and full GCSEs, the latter table contains results from other types of qualification.
- yr16Progress - admin data relating to the 16 Year study, such as return dates.
- \Export\ subdirectory, containing exported
16 Year raw data files. These files are directly used to construct the
analysis dataset. The files are exported from the Access database described
above; they are csv files. Each exported csv files originates from a table in
the Access database above. The files are called:
- Parent1.csv, Parent2.csv (parent Behaviour/LEAP Study booklet data)
- Child1.csv, Child2.csv, Child3.csv (twin Behaviour/LEAP Study booklet data)
- Leap2Parent1.csv, Leap2Parent2.csv (parent LEAP-2 Study booklet data)
- Leap2Child1.csv, Leap2Child2.csv (twin LEAP-2 Study booklet data)
- GCSEresults.csv, OtherExamResults.csv (twin GCSE and other examination results)
- 16yrAdmin.csv (admin data from table yr16Progress, including return dates)
- \web data files\ subdirectory,
containing aggregated web test data files. There is one file for
each of the twin web activities, although the 4 questionnaire
activities have been merged into a single file. There is also a
file containing web family data, including the brief parent
questionnaire data, dates, and status flags for each twin
activity. Each file contains data
aggregated from the two waves of
data collection, with identifying fields (like names) removed. These files
were aggregated, with some cleaning, from the raw analysis files that were
originally downloaded from the web server.
The web data files are as follows.
- 16yr_families_status.csv - web study progress data for each family, plus the parent web SES questionnaire data
- 16yr_corsi_block.csv - Corsi Block test data
- 16yr_dot_number.csv - Dot Number test data
- 16yr_number_line.csv - Number Line test data
- 16yr_number_sense.csv - Number Sense (Dot Task) test data, including derived Weber fractions scores
- 16yr_PVT.csv - Problem Verification Task test data
- 16yr_reaction_times.csv - Reaction Times test data
- 16yr_understanding_number.csv - Understanding Number test data
- 16yr_raven.csv - Ravens Matrices test data
- 16yr_mill_hill.csv - Mill Hill Vocabulary test data
- 16yr_expressions.csv - Expressions (Figurative Language) test data
- 16yr_yes_no.csv - Yes/No (Reading Fluency) test data
- 16yr_passages.csv - Passages (RC-16) test data
- 16yr_qnrABCD.csv - combined data from the four twin "Environment and Wellbeing" web questionnaires: Part A, About You; Part B, Your School; Part C, You and Your Friends; Part D, More About You
Dataset files
These files are currently stored in the \System\Datasets\16yr\ folder. The following list refers to items within this folder.
- Pdb9456.sav - the SPSS version of the full 16 Year dataset
- \working files\ - this subdirectory contains various intermediate files, saved during the process of converting the raw data into the dataset. These files include working datasets p1merge, p2clean, p3derive, p4label1, p5label2, p6double (all .sav files), saved at the end of the 6 scripts. The latter file is identical (except for the name) to the final dataset Pdb9456.sav.
Syntax files (scripts)
These files are currently stored in the
\System\Scripts\16yr\ folder.
Note that these are SPSS syntax files. The names of the scripts are
P1_merge, P2_clean, P3_derive, P4_label_part1,
P5_label_part2, P6_double (all .sps files). The
processing
carried out by these scripts is described on another page.