Raw Data files
These are currently stored in the \System\Rawdata\14yr\ folder, and the list below refers to files and sub-directories within this folder.
The Access database has been periodically updated to keep it in the most recent Access format, to avoid software compatibility issues. Nevertheless, the essential data from the tables are also stored in exported "copies" in the form of the csv files in the Export folder.
- 14yr.accdb.
This is the Access database file (2007 format) containing aggregated and cleaned 14 Year raw data from the booklets/questionnaires and from administrative sources. (Note that it does not contain the web test data; the files of web test data are generally very large, and contain far too many columns to be accommodated in Access tables.)
This Access database is now treated as the master copy of the data (except for the web tests), and is the source of such data for the analysis dataset. The important tables in the database are:- Teacher - data from the teacher questionnaires
- Parent - data from the parent booklet
- Child - data from the child booklet
- SLQ - data from the parent SLQ questionnaire
- yr14Progress - admin data relating to the 14 Year study, such as return dates. This table was originally in the TEDS admin database.
- \Export\ subdirectory, containing 14 Year raw data files
that have been exported from the Access database. These files are directly used to construct the
analysis dataset. They are csv text files. The files are called:
- Parent.csv (parent booklet data)
- SLQ.csv (parent SLQ questionnaire data)
- TwinQuestionnaire.csv (twin booklet data)
- Teacher.csv (teacher questionnaire data)
- Admin.csv (general admin data from table yr14Progress, including booklet return dates)
- \web data files\ subdirectory,
containing aggregated web test data files. There is one file for
each of the twin web activities (Science, Vocabulary, Ravens
Matrices) plus a file containing web family status data (such as
parent consent, and start and end dates and test status for each
twin). Each such file contains data
aggregated from the two waves of
data collection, with identifying fields (like names) removed. These files
were aggregated, with some cleaning, from the raw analysis files that were
originally downloaded from the web server.
The web data files are as follows.
- 14yr_science.csv - Science test data
- 14yr_raven.csv - Ravens test data
- 14yr_vocab.csv - Vocabulary test data
- 14yr_families_status.csv - Family status data
Dataset files
These files are currently stored in the \System\Datasets\14yr\ folder. The following list refers to items within this folder.
- Ndb9456.sav - the PSS version of the 14 Year dataset
- \working files\ - this subdirectory contains various intermediate files, saved during the process of converting the raw data into the dataset. These files include working datasets n1merge, n2recode, n3derive, n4label, n5double (all .sav files), saved at the end of the 5 scripts. The latter file is identical (except for the name) to the final dataset file named above.
Syntax files (scripts)
These files are currently stored in the
\System\Scripts\14yr\ folder.
Note that these are SPSS syntax files. The names of the scripts are
N1_merge, N2_recode, N3_derive, N4_label, N5_double
(all .sps files). The
processing
carried out by these scripts is described on another page.