=================================================== ** ** NEPS STARTING COHORT 4 - RELEASE NOTES a.k.a CHANGE LOG ** changes and updates for release NEPS SC4 6.0.0 ** (doi:10.5157/NEPS:SC4:6.0.0) ** =================================================== * Known Issues * - an updated data manual is not yet published; please use the appropriate data manual from version 1.0.0 instead - CohortProfile contains the variable tx80232 Individual tracking: Type of school (PAPI), which is victim of a severe coding error, please use the following piece of sophisticated Stata syntax to resolve the problem: recode tx80232 (8=10) (7=9) (6=8) (5=7) (4=6) (3=5) (2=4) (1=3) =================================================== * Changes introduced to NEPS:SC4 by version 6.0.0 * =================================================== General: - starting with this release, all NEPS Scientific Use Files will ship with an additional, unicode-enabled Stata data set version; this version is only readable in Stata version 14 or younger, and is placed in the subdirectory "Stata14" - translation for all meta data (variable and value labels, question texts, etc) have been revised and completed - meta data for all variables have been revised and updated where appropriate - additional waves 5 and 6 have been incorporated into the data CohortProfile: - variable "State of participation/attrition" [tx80220] erroneously did not reflect attrition of finally dropped out cases in version 4.0.0, but they were categorized as "temporary drop out"; this has been fixed pCourseClass: - linkage recommendation variable [ex20100] in had been erroneously calculated across waves, not inside each wave, in version 4.0.0; this has been fixed pCourseGerman: - linkage recommendation variable [ex20100] in had been erroneously calculated across waves, not inside each wave, in version 4.0.0; this has been fixed pCourseMath: - linkage recommendation variable [ex20100] in had been erroneously calculated across waves, not inside each wave, in version 4.0.0; this has been fixed pEducator: - due to a merging error, version 4.0.0 of teh dataset did not contain information about educators from wave 1; this has been fixed pInstitution: - version 4.0.0 contained a wrong value and variable label of variable [h228002], which is erroneously equal to labels of variable [h228000]; the correct variable label is: "School: number of schools of the same type in the vicinity"; this has been fixed pTarget: - the filename of the dataset "pTargetPAPI" has been changed to "pTarget", as it now contents additional information from the CAWI interview in wave 5 - please refer to the field and methods report online for details on the wave 5 CAWI survey - variables concerning expected income [t513011-t513020] in the "pTargetPAPI" data set in version 4.0.0 have been erroneously renamed to match variable names from questions in newer waves, but with different wording; this has been fixed - PAPI variables containing the result of automated multi-answer coding suffered from an error in the recoding routine for version 4.0.0 and were erroneously only filled with missing values; this has been fixed variables affected from this issue are: "Mother tongue" [t41000a] in the "pTargetPAPI" data set "Second language" [t410010] in the "pTargetPAPI" data set "Nationality" [t40115a] in the "pTargetPAPI" data set "Mother: Mother tongue" [t41010a] in the "pTargetPAPI" data set "Father: Mother tongue" [t41012a] in the "pTargetPAPI" data set - a set of PAPI variables suffered from a meta data rename error in version 4.0.0 and contained erroneous labels; this has been fixed variables affected from this issue are (complete set of variables including unaffected variables): "Items at home: desk" [t34006a] in the "pTargetPAPI" data set "Items at home: room" [t34006b] in the "pTargetPAPI" data set "Items at home: education software" [t34006c] in the "pTargetPAPI" data set "Items at home: classic literature" [t34006d] in the "pTargetPAPI" data set "Items at home: poetry books" [t34006e] in the "pTargetPAPI" data set "Items at home: art (paintings)" [t34006f] in the "pTargetPAPI" data set "Items at home: books for homework" [t34006g] in the "pTargetPAPI" data set "Items at home: dictionary" [t34006h] in the "pTargetPAPI" data set pTargetCATI: - when generating variable "Global self-esteem" [t66003a_g1] in the pTargetCATI data set for version 4.0.0, variable "Global self-esteem: competence" [t66003d] erroneously had been ignored; this has been fixed; t66003a_g1 can be re-generated in version 4.0.0 data using the following Stata syntax: * -------------------------BEGIN Stata------------------------------- nepsmiss t66003a t66003b t66003c t66003d t66003e t66003f t66003g t66003h t66003i t66003j tempvar t66003b_r t66003e_r t66003f_r t66003h_r t66003i_r rowmissings recode t66003b (1=5) (2=4) (3=3) (4=2) (5=1), generate(`t66003b_r') recode t66003e (1=5) (2=4) (3=3) (4=2) (5=1), generate(`t66003e_r') recode t66003f (1=5) (2=4) (3=3) (4=2) (5=1), generate(`t66003f_r') recode t66003h (1=5) (2=4) (3=3) (4=2) (5=1), generate(`t66003h_r') recode t66003i (1=5) (2=4) (3=3) (4=2) (5=1), generate(`t66003i_r') egen `rowmissings'=rowmiss(t66003a `t66003b_r' t66003c t66003d /// `t66003e_r' `t66003f_r' t66003g `t66003h_r' `t66003i_r' t66003j) egen `target_variable'=rowtotal(t66003a `t66003b_r' t66003c t66003d /// `t66003e_r' `t66003f_r' t66003g `t66003h_r' `t66003i_r' t66003j) if `rowmissings'==0 & wave==3 replace `target_variable'=-54 if wave!=3 label variable `target_variable' "Global self-esteem" replace `target_variable'=-55 if missing(`target_variable') * ---------------------------END Stata-------------------------------- - SUF version SC4-4.0.0 contained the variable [tf11153] it has been renamed to [tf11154] in SUF version SC4-6.0.0 - SUF version SC4-4.0.0 contained the variable [t31035a_v1] it has been renamed to [t31035a] in SUF version SC4-6.0.0 - SUF version SC4-4.0.0 contained the variable [t31035a] it has been renamed to [t31035b] in SUF version SC4-6.0.0 - SUF version SC4-4.0.0 contained the variable [t31135a_v1] it has been renamed to [t31135a] in SUF version SC4-6.0.0 - SUF version SC4-4.0.0 contained the variable [t31135a] it has been renamed to [t31135b] in SUF version SC4-6.0.0 pParent: - as wave 5 data makes this a panel dataset, the filename has changed from "xParent" to "pParent" xTargetCompetencies: - competency data for wave 3 test "English as a foreign language" could not be delivered in due time for version 4.0.0 by the responsible data editors; integration into the "xTargetCompetencies" dataset followed with the version 6.0.0 release; this also slightly affects "State of participation/attrition" [tx80220] if some respondents participated in testing, but not in the survey Biography: - additional spells of type "data edition gap" have been inserted to fill gaps between (a) the eighth birth day and the first reported episode and (b) the most recently reported episode and the most recent interview date =================================================== * Changes introduced to NEPS:SC4 by version 4.0.0 * =================================================== General: - SPSS data sets now ship with the same "VARIABLE ATTRIBUTES" as Stata data sets' "characteristics" - metadata for all datasets has been revised and updated where appropriate - variables now ship with a characteristic 'NEPS_instname' attached in Stata datasets, reporting the variable name used in the survey - wave 3 and 4 data has been fully integrated into the data - several bugfixes and enhancements have been integrated into this new release, influencing various variables; only the most important ones are listed in this change log - meta data in all data sets have been revised and updated where appropriate - data from the first two surveys of persons leaving the school system for vocational education have been added (waves 3 and 4) - data from the third survey in the school system have been added (wave 3) CohortProfile: - a variable indicating the current type and track of school per class has been generated (t723080_g1) pTargetPAPI: - as wave 3 data makes this a panel dataset, the filename has changed from "xTarget" to "pTargetPAPI" - variables indicating migrational background (t400500_g1 through _g3) have been added spParentSchool: - as from wave 3 on there are several sources for biography episodes, those originating from the parent interview are prefixed by "Parent", changing the filename from "spSchool" to "spParentSchool" - the interview process in parent's interviews does not guarantee unique ids for parents; thus, the identifier in this dataset is no longer "ID_p", but the target person's "ID_t" spParentGap: - as from wave 3 on there are several sources for biography episodes, those originating from the parent interview are prefixed by "Parent", changing the filename from "spGap" to "spParentGap" - the interview process in parent's interviews does not guarantee unique ids for parents; thus, the identifier in this dataset is no longer "ID_p", but the target person's "ID_t" RepWeights: - replication weights have been transferred to a separate dataset "RepWeights" pEducator: - as wave 3 data makes this a panel dataset, the filename has changed from "xEducator" to "pEducator" pInstitution: - as wave 3 data makes this a panel dataset, the filename has changed from "xInstitution" to "pInstitution" pCourseClass: - as wave 3 data makes this a panel dataset, the filename has changed from "xCourseClass" to "pCourseClass" pCourseGerman: - as wave 3 data makes this a panel dataset, the filename has changed from "xCourseGerman" to "pCourseGerman" pCourseMath: - as wave 3 data makes this a panel dataset, the filename has changed from "xCourseMath" to "pCourseMath" =================================================== * Changes introduced to NEPS:SC4 by version 1.1.0 * =================================================== General: - metadata for all datasets has been revised and updated where appropriate CohortProfile: - variable 'Gender (plausible)' [tx29001] has been generated, reflecting all reported information from surveys of target persons in waves 1 and 2, parental interview and lists of students wave 1 and 2 (in this logical sequence, read: information given by target persons in wave 1 superseeding information given by target persons in wave 2, which is superseeding information from the parental interview and so on) - variable 'Year of birth (plausible)' [tx2900y] has been generated, reflecting all reported information from surveys of target persons in waves 1 and 2, parental interview and lists of students wave 1 and 2 (in this logical sequence, read: information given by target persons in wave 1 superseeding information given by target persons in wave 2, which is superseeding information from the parental interview and so on) - variable 'Month of birth (plausible)' [tx2900m] has been generated, reflecting all reported information from surveys of target persons in waves 1 and 2, parental interview and lists of students wave 1 and 2 (in this logical sequence, read: information given by target persons in wave 1 superseeding information given by target persons in wave 2, which is superseeding information from the parental interview and so on) - variable 'Data available: competency test subject' [tx80522] was erroneous and has been corrected, which also influences variable 'State of participation/attrition' [tx80220] - variables weight_design_std weight_design have been removed (moved to Weights, see below) xTarget: - variables 'Month of birth' [t70004m], 'Year of birth' [t70004y] and 'Gender child' [t700031] are now stored as wide variables (suffixes _w1 and _w2), reflecting the corresponding questions had been asked both in waves 1 and 2 - variable 'Gender (plausible)' [tx29001] has been added from CohortProfile (see detailed description there) - variable 'Year of birth (plausible)' [tx2900y] has been added from CohortProfile (see detailed description there) - variable 'Month of birth (plausible)' [tx2900m] has been added from CohortProfile (see detailed description there) - added string version of 'Idealistic/Realistic vocational aspirations: Preferred choice of career': [t31060a_O] [t31160a_O] - variable [DELIMITER1] has been generated, separating blocks of variables originating from basic questions and wave 1 questions - variable [DELIMITER2] has been generated, separating blocks of variables originating from wave 1 questions and wave 2 questions - ISEI-08: updated transcoding scheme implemented [tf00260_g14], [tf0029b_g14], [tf0021b_g14], [t31060a_g14], [t31160a_g14], [t731422_g14], [t731472_g14], [tf00070_g14], [tf0013b_g14] - slightly modified general transcoding scheme with minor consequence on all derived variables [*_g1 to *_g16] - recoding of language variables ([t41000a_*], [t41010a_*], [t41012a_*], [t410010_*]) has been revised and updated if necessary xParent: - ISEI-08: updated transcoding scheme implemented [p296402_g14], [p731904_g14], [p731954_g14] - slightly modified general transcoding scheme with minor consequence on all derived variables [*_g1 to *-g16] xEducator: - ISEI-08: updated transcoding scheme implemented [e537061_g14], [e537062_g14] - slightly modified general transcoding scheme with minor consequence on all derived variables [*_g1 to *-g16] xTargetCompetencies: - new variables for procedural metacognition [*_sc6] and L1-Targetpopulation [*_sc7] - 146 empty cases have been removed - datafile has been revised and missings were tagged more precisely Weights: - new datafile for weights has been generated. All weights previously stored in CohortProfile can be found here now. xTargetMicrom, xInstitutionMicrom: - new regional data (microm data) has been added