In recent decades, the Big Five personality dimensions have become increasingly established as a comprehensive framework to describe personality (e.g., John et al., 2008; McCrae & Costa, 2008). This has led to broad interest in their assessment, even in fields outside core personality research, such as sociology, economics, and epidemiology. Nowadays, the Big Five are included in most large-scale social surveys as an almost standard construct, like subjective well-being.
The data resulting from these large-scale social surveys are highly valuable for personality research. The public large-scale surveys include a variety of additional constructs, they rely mostly on population-representative or at least heterogeneous samples, and often follow a longitudinal design, allowing to address key research questions of personality psychologists. Moreover, these public large-scale surveys have numerous advantages in terms of sample size and sample quality compared to the typically self-conducted small-scale studies still dominating personality psychology and adjacent fields. Self-conducted studies are usually based on small, selective samples of college students or—as it is increasingly common today—samples collected using Amazon Mechanical Turk (MTurk) (e.g., Webb & Tangney, 2022). As well-funded programs run by professionals specialized in survey research methods, these surveys typically far exceed the scope of data collections that an individual researcher or research group could ever hope to carry out alone. Many of these survey programs comprise panel data or repeated cross-sectional data that enable longitudinal analyses, greatly expanding the type of research questions that can be answered and offering opportunities for causal inference (e.g., by using fixed-effects models). These data are usually freely available to personality psychologists (and other researchers) for secondary analyses. However, apart from a few highly prominent and widely used surveys, such as the German Socio-Economic Panel (SOEP; Goebel et al., 2019), most of the large-scale surveys presented in this paper do not appear to be widely known among personality psychologists and remain underutilized in current research. This is unfortunate because these surveys have enormous analytical potential for research on the development, consequences, or predictors of personality traits—a potential that has thus far lain largely dormant. Indeed, many of these surveys are like hidden gems that have yet to be discovered by personality psychologists.
The aim of the present paper is therefore to provide researchers in personality psychology and beyond with an overview of these available and reusable data sets. Because they are so numerous, we deliberately limited this overview to surveys that (a) focused on the adult population, (b) were conducted in Germany, (c) were based on probability samples, (d) had a minimum sample size of about 1,500 respondents, and (e) included an assessment of all Big Five dimensions with a validated Big Five instrument.
Search Strategy
In the first step, we included surveys in our overview with which we were personally familiar by virtue of having worked extensively with them in the past. Second, to provide a more comprehensive and less subjective overview, we systematically searched the databases of the following German research data centers using the key words personality and Big Five: the GESIS Data Archive for the Social Sciences; the Research Data Center of the SOEP; the Research Data Centre of the German Centre for Higher Education Research and Science Studies (DZHW); the Research Data Center of the Federal Institute for Vocational Education and Training (BIBB); the Research Data Center of the Institute for Employment Research (IAB). Third, we screened well-known international survey programs for the inclusion of personality measures. Finally, we contacted experts affiliated with the identified surveys to solicit tips about additional data sources. These steps resulted in 25 data sets. Although this list may not be exhaustive, it does cover the most important and prominent large-scale surveys in Germany that include personality measures.1
Overview of the Selected Surveys
In what follows, we give an aggregated overview of the selected surveys. In this overview, we address central aspects of the study designs and the measures used to assess the Big Five. These central aspects largely define the research questions that can be posed around the Big Five. In a next step, we sketch some research potentials that emerge from further characteristics in the surveys. In addition, Table 1 provides a structured overview of the substantive focus of each survey, and Table 2 summarizes details of their designs, sample sizes, personality measures, etc.
Table 1
Survey Program | Further Specification of the Data Set | Study Focus | Exemplary Psychological Constructs | URL | Reference | Data Accessibilitya | Notes |
---|---|---|---|---|---|---|---|
ALLBUS | ISSP Germany 2003 & 2004b | Attitudes, behavior, and social change; focus on social inequality and digital divide; national identity | — | https://www.gesis.org/en/allbus/allbus-home | Terwey, 2000 | Free access upon registration | ISSP Germany 2003 & 2004 are part of the ALLBUS 2004 data set |
ALLBUS | ISSP Germany 2005 & 2006b | Attitudes, behavior, and social change; focus on attitudes toward different ethnic groups, work orientations, role of government | — | https://www.gesis.org/en/allbus/allbus-home | Terwey, 2000 | Free access upon registration | ISSP Germany 2005 & 2006 are part of the ALLBUS 2006 data set |
ALLBUS | ISSP Germany 2007 & 2008b | Attitudes, behavior, and social change; focus on political participation, leisure and sport, religion | — | https://www.gesis.org/en/allbus/allbus-home | Terwey, 2000 | Free access upon registration | ISSP Germany 2007 & 2008 are part of the ALLBUS 2008 data set |
BIBB/BAuA Employment Survey | BIBB – Personality traits and employment | Employment, working conditions, skill requirements, health | Locus of control, occupational self-efficacy, workaholism, procrastination, risk taking, vocabulary test | https://www.bibb.de/en/index.php | Rohrbach-Schmidt et al., 2020 | Free access for scientific purposes upon request | Supplementary survey to the BIBB/BAuA Employment Survey (https://metadaten.bibb.de/en/dataset/detail/123) |
DZHW PhD Panel | PhD Panel 2014 | Careers of doctorate holders | Self-efficacy, internal/external locus of control, life goals | https://www.dzhw.eu/en/index_html | G. Brandt et al., 2020 | Free access for scientific purposes upon request | |
DZHW Graduate Panel | Graduate Panel 2009 | Study, career entry, career and further qualification of university graduates | Life goals, study motives | https://www.dzhw.eu/en/forschung/projekt?pr_id=302 | G. Brandt et al., 2022 | Free access for scientific purposes upon request | |
FReDA | FReDA GGS Sample; Anchors | Dyadic multi-actor study on family life, relationships, and demography; gender roles, reproductive health | Subjective well-being, attitudes and values, satisfaction, COVID-related attitudes, relationship quality, self-esteem | https://www.freda-panel.de/FReDA/EN/Startseite.html | Schneider et al., 2021 | Free access for scientific purposes upon request | Data already collected, personality assessments to be released |
GESIS Panel | Multi-topic open panel infrastructure with yearly core modules (e.g., personality, life satisfaction) and externally submitted studies | Values, political behavior and orientations, well-being, quality of life, environmental attitudes and behavior, satisfaction, religiosity, COVID-19 behavior | https://gesis-panel.org | Bosnjak et al., 2018 | Free access for scientific purposes upon request | ||
GIP | Multi-topic panel, attitudes and preferences in political and economic decision-making processes | Satisfaction, COVID-related behavior, health, political efficacy | https://www.uni-mannheim.de/en/gip | Blom et al., 2015 | Free access for scientific purposes upon request | Subsample of about N = 3,600 participants in a COVID-related longitudinal sub-study with higher frequency | |
GLES | Cross-Section 2017, Pre- and Post-Election | Election research, political attitudes and behaviors during election campaigns and after federal elections in Germany | Emotions, trust, political knowledge, religiosity | https://gles-en.eu | GLES, 2019 | Free access upon registration | Data consist of two subsamples, surveyed before and after the election; N = 3,412 agreed to be included in the GLES Panel (Sample B), which conducts up to two waves per year |
GLES | Cross-Section 2021, Pre-Election | Election research, political attitudes and behaviors during election campaigns and after federal elections in Germany | Emotions, trust, political knowledge, religiosity | https://gles-en.eu | GLES, 2022 | Free access upon registration | Participants could agree to be included in the GLES Panel (Sample C), which conducts up to two waves per year |
SOEP | The SOEP includes several subsamples with different sociodemographic foci (not further specified in this table) | Living in Germany; household composition, occupational biographies, employment, earnings, health and satisfaction indicators | Satisfaction, self-esteem, trust, risk aversion, worries, well-being, depressive traits, life goals, locus of control, loneliness | https://www.diw.de/en/soep | Goebel et al., 2019 | Free access for scientific purposes upon request | |
ISJP Germany | ISJP 4 | Attitudes to social justice and social inequality; focus on familial social support | Satisfaction, justice attitudes | https://www.sowi.hu-berlin.de/de/lehrbereiche/empisoz/forschung/archiv/isjp | Wegener, 2015 | Free access upon registration | |
Nacaps | Nacaps 2018 | Career paths of highly qualified academics in Germany | Willingness to take risks, self-efficacy, locus of control, motives for doing a doctorate | https://www.nacaps.de/en/index_html | Adrian et al., 2020 | Free access for scientific purposes upon request | |
Nacaps | Nacaps 2020 (follow-up cohort) | Career paths of highly qualified academics in Germany | Willingness to take risks, self-efficacy, locus of control, motives for doing a doctorate | https://www.nacaps.de/en/index_html | When released, free access for scientific purposes upon request | Data already collected; to be released | |
NAKO | Starting Cohort | Multidisciplinary, population-based cohort study to investigate the development and aetiology of diseases, identify risk factors, and enhance early detection and prevention of diseases | Depression, anxiety, childhood trauma, personality traits, professional recognition, occupational stress, health-related quality of life | https://www.nako.de | Peters et al., 2022 | Free access for scientific purposes upon application | Follow-up survey completed in 2022. Follow-up examinations will be completed in 2024. Two additional surveys on COVID-19 (May/June 2020, October/November 2022). |
NEPS | Starting Cohort 5 | Educational processes and competence development | Domain‐general and domain-Specific competencies, self-esteem, vocational interests, satisfaction | https://www.neps-data.de/Mainpage | Blossfeld et al., 2011 | Free access for scientific purposes upon request | |
NEPS | Starting Cohort 6 | Educational processes and competence development | Domain‐general and domain-specific competencies, self-esteem, vocational interests, satisfaction, values | https://www.neps-data.de/Mainpage | Blossfeld et al., 2011 | Free access for scientific purposes upon request | |
Pairfam | Anchors | Multi-actor study on partnership and family dynamics, transition to parenthood, intergenerational relationships, parenting and child development | Loneliness, self-esteem, dyadic coping, depressiveness, anger, sexuality, dark triad | https://www.pairfam.de/en/ https://www.freda-panel.de/EN |
Huinink et al., 2011 | Fee-based access for scientific purposes upon request | Subsample of N = 3,160 participated in an additional, optional online survey on COVID-19. The pairfam sample was integrated into the FReDA panel in 2022. |
PASS (IAB) | Labor market and social security; dynamics between basic income support and social situations of households, poverty | Satisfaction, self-efficacy, impulsivity, memory test, benefits of work | https://iab.de/en/the-iab/surveys/panel-study-pass/ | Trappmann et al., 2019 | Free access for scientific purposes upon request | ||
Personality and Voting Behavior 2003 | Personality traits, voting behavior and political orientation, political knowledge tests. |
Values, authoritarianism, political efficacy | https://search.gesis.org/research_data/ZA4052 | Schumann, 2004 | Free access upon registration | ||
PIAAC-L | PIAAC-Longitudinal | Development of cognitive competences, longitudinal effects of skill outcomes over the life course, cognitive competencies in a broader context | Competencies, reading, numeracy, reading, reciprocity, locus of control, perseverance, satisfaction, grit, life events | https://www.gesis.org/en/piaac/rdc/data/piaac-longitudinal | Rammstedt et al., 2017 | Free access for scientific purposes upon request | |
SHAREb | Ageing processes in Europe; health, socioeconomic status, and social and family networks | Cognitive function, mental health, work quality, loneliness, trust | http://www.share-project.org/home0.html | Börsch-Supan et al., 2013 | Free access for scientific purposes upon request | ||
TwinLife | Cohort 4 | Extended twin family study on the development of social inequality; biological and social origins | Satisfaction, self-esteem, psychopathology, achievement motivation, academic self-concept, cognitive abilities | https://www.twin-life.de/documentation/ | Hahn et al., 2016; | Free access for scientific purposes upon request | Four TwinLife COVID-19 Supplementary Surveys have been conducted. |
WVSb | Sample Wave 6 | Social, political, economic, religious and cultural values, beliefs, and norms | Happiness, life satisfaction, values, religiosity, | https://www.worldvaluessurvey.org/wvs.jsp | Ingelhart et al., 2018 | Free access upon registration |
Note. ALLBUS = Allgemeine Bevölkerungsumfrage der Sozialwissenschaften [German General Social Survey]; BIBB/BAuA: BIBB = Bundesinstitut für Berufsbildung [Federal Institute for Vocational Education and Training]; BAuA= Bundesanstalt für Arbeitsschutz und Arbeitsmedizin [Federal Institute for Occupational Safety and Health]; DZHW = Deutsche Zentrum für Hochschul- und Wissenschaftsforschung [German Centre for Higher Education Research and Science Studies]; FReDA = Family Research and Demographic Analysis—the German Family Demography Panel Study; GGS = the Generations and Gender Survey; GIP = the German Internet Panel; GLES = the German Longitudinal Election Study; SOEP = the German Socio-Economic Panel; ISJP = the International Social Justice Project; ISSP = the International Social Survey Programme; Nacaps = the National Academics Panel Study; NAKO = Nationale Kohorte [German National Cohort]; NEPS = the German National Educational Panel Study; pairfam = Panel Analysis of Intimate Relationships and Family Dynamics—the German Family Panel; PASS = Panel Arbeitsmarkt und soziale Sicherung [Panel Study Labour Market and Social Security]; IAB = Institut für Arbeitsmarkt- und Berufsforschung [Institute for Employment Research (the research institute of the Federal Employment Agency]; PIAAC-L = the Programme for the International Assessment of Adult Competencies Longitudinal; SHARE = the Survey of Health, Ageing and Retirement in Europe; WVS = the World Values Survey.
a Many survey programs provide multiple data versions that differ in their levels of sensitivity and potential for de-identification. Displayed in the column is the accessibility for the standard scientific use files; data versions that include more sensitive variable might entail higher access barriers (e.g., access via on-site use or remote desktops); which, however, are not further specified in this table. b Studies are part of international survey programmes. Thus, corresponding data from other countries are also available.
Table 2
Survey Program | Further Specification of the Data Set | Target Population | N (at First Big Five Assessment) | Big Five Measure | Year of First Big Five Assessment | Longitudinal Design, Yes/No | Repeated Big Five Assessments, Yes/No | Study Span | Sampling Design | Mode (Big Five) | Big Five Reports From/About Others? |
---|---|---|---|---|---|---|---|---|---|---|---|
ALLBUS | ISSP Germany 2003 & 2004 | Age ≥ 18 years | 2,619 | BFI-10 | 2004 | No | No | Cross-sectional | Register | CAPI + SAQP | — |
ALLBUS | ISSP Germany 2005 & 2006 | Age ≥ 18 years | 3,344 | BFI-10 | 2006 | No | No | Cross-sectional | Register | CAPI + CASI | — |
ALLBUS | ISSP Germany 2007 & 2008 | Age ≥ 18 years | 3,423 | BFI-10 | 2008 | No | No | Cross-sectional | Register | CAPI + CASI | — |
BIBB/BAuA Employment Survey | BIBB-Personality traits and employment | Employed (≥ 10 hr/week) persons aged ≥ 15 years | 8,010 | BFI-S | 2017/2018 | No | No | Cross-sectional | Telephone sample, dual frame | CATI | — |
DZHW PhD panel | PhD Panel 2014 | Doctorate holders who completed their doctorate in 2014 | 5,408 | BFI-10 | 2015 | Yes | Yes, in 2018 and 2021 | 2015–ongoing | Total universe | SAQP + CAWI | — |
DZHW Graduate Panel | Graduate Panel 2009 | Higher education graduates with first degree in 2009 | 2,465 | BFI-10 | 2015 | Yes | No | 2010–2019 | Stratified cluster sample | (SAQP +) CAWI | — |
FReDA | FReDA GGS Sample; Anchors | Adults aged 18 to 49 years | 37,783a | BFI-2-XS | 2,022 | Yes | Yes, planned (cycle yet to be decided) | 2021–ongoing | Register | CAWI + SAQP | Self-reports from partners |
GESIS Panel | Age ≥ 18 years | 4,035 | BFI-10; BFI-2-S (once in 2017) | 2014 | Yes | Yes, every year | 2014–ongoing | Register | CAWI & SAQP | — | |
GIP | Adults aged 16 to 75 years | 1,483 | BFI-10 | 2012 | Yes | Yes, in 2014 (and 2018)a | 2012–ongoing | Multi-stage with random route; register-based | CAWI | Self-reports from household membersb | |
GLES | Cross-Section 2017, Pre- and Post-Election | Age ≥ 16 years | 4,291 | BFI-10 | 2017 | No | No | 2017 | Register | CAPI | — |
GLES | Cross-Section 2021, Pre-Election | Age ≥ 16 years | 5,116 | BFI-10 | 2021 | No | No | 2021 | Register | CAWI + SAQP | — |
SOEP | The SOEP includes subsamples with different sociodemographic foci | Age ≥ 16 years | 21,105 (2005) | BFI-S/BFI-S (+1 Openness item since 2009) | 2005 | Yes | Yes, every four years | 1984–ongoing | Several designs: multi-stage with random route; register-based | PAPI + CAPI | Self-reports from household members |
ISJP Germany | ISJP 4 | Adults aged 18–85 years | 3059 | BFI-S | 2006 | No | No | No | Register | PAPI | — |
NACAPS | Nacaps 2018 | Doctoral candidates and doctorate holders | 28,368 | BFI-S | 2019 | Yes | Yes, every year | 2019–ongoing | Total universe: doctoral candidates enrolled on December 1, 2018 | CAWI | — |
NACAPS | Nacaps 2020 (follow-up cohort) | Doctoral candidates and doctorate holders | 15,472 | BFI-S | 2021 | Yes | Yes, every year | 2021–ongoing | Total universe: doctoral candidates enrolled for the first time between December 2, 2018 and December 1, 2020 | CAWI | — |
NAKO | Starting Cohort | Adults aged 20–74 years | 205,415 | BFI-S | 2014-2019 | Yes | Yes, 2019–2024 | 2014–ongoing | Register | CASI | — |
NEPS | Starting Cohort 5 | First‐year students | 13,233 | BFI-10 (+ 1 Agreeableness item) | 2012 | Yes | Yes, in 2016 | 2010–ongoing | Multi-stage sampling | CATI | — |
NEPS | Starting Cohort 6 | Adults aged 21–63 years | 11,707 | BFI-10 (+ 1 Agreeableness item) | 2012/13 | Yes | Yes, in 2015/6 | 2007–ongoing | Register | CAPI + CATI | — |
Pairfam | Anchors | Adults aged between 16–18, 26–28 and 36–38 (and their partners, children, parents) | 9,069 | BFI-K | 2009/10 | Yes | Yes, every four years | 2008–ongoing | Register | CASI | Other-reports about own child (age <= 8 years) and partner's ideal personality; self-reports of partner, parents and children (age 8–14 years) |
PASS (IAB) | Age ≥ 15 years; in households registered in Germany, and in households in receipt of basic income support | 15,607 | BFI-K | 2011 | Yes | Yes, in 2019 | 2007–ongoing | Register (of welfare recipients); register from a private data provider | CAPI + CATI | Self-reports from household members | |
Personality and Voting Behavior 2003 | Age ≥ 18 years, eligible to vote | 2,544 | BFI-K; NEO-FFI; BFI | 2003 | No | No | Cross-sectional | Multi-stage with random route | PAPI + SAQP | Assessment of the personality of politicians | |
PIAAC-L | PIAAC-Longitudinal | PIAAC 2012 respondents aged 16–65 years and their household members | 6,231 | BFI-S (+ 1 Openness item) | 2,014 | Yes | Yes, in 2016 | 2012–2016 | Register | CAPI | Self-reports from partners |
SHARE | Age ≥ 50 years and their partners | 3,820 | BFI-10 (+ 1 Agreeableness item) | 2,017 | Yes | Yes, in 2019/20 | 2004–ongoing | Register | CAPI | Self-reports from partners; self-reports from the interviewers | |
TwinLife | Cohort 4 c | Twins aged ~23 years and their families |
3,792 (in 984 families with twins) c | BFI-S (+ 1 Openness item) | 2014–2016 | Yes | Yes, again in 2018–2020; and thereafter every two years | 2014–ongoing | Register | CASI | Self-reports from twins, siblings, partners, parents (other-reports only for younger cohorts) |
WVS | Sample Wave 6 | Age ≥ 18 years | 2,046 | BFI-10 | 2013 | No | No | Cross-sectional | Register | CAPI | - |
Note. ALLBUS = Allgemeine Bevölkerungsumfrage der Sozialwissenschaften [German General Social Survey]; BIBB/BAuA: BIBB = Bundesinstitut für Berufsbildung [Federal Institute for Vocational Education and Training]; BAuA= Bundesanstalt für Arbeitsschutz und Arbeitsmedizin [Federal Institute for Occupational Safety and Health]; DZHW = Deutsche Zentrum für Hochschul- und Wissenschaftsforschung [German Centre for Higher Education Research and Science Studies]; FReDA = Family Research and Demographic Analysis—The German Family Demography Panel Study; GGS = Generations and Gender Survey; GIP = the German Internet Panel; GLES = the German Longitudinal Election Study; SOEP = the German Socio-Economic Panel; ISJP = the International Social Justice Project; NACAPS = the National Academics Panel Study; NAKO = Nationale Kohorte [German National Cohort]; NEPS = the National Educational Panel Study; pairfam = the Panel Analysis of Intimate Relationships and Family Dynamics; PASS = Panel Arbeitsmarkt und soziale Sicherung [Panel Study Labour Market and Social Security]; IAB = Institut für Arbeitsmarkt- und Berufsforschung [Institute for Employment Research, the research institute of the Federal Employment Agency]; PIAAC-L = Programme for the International Assessment of Adult Competencies Longitudinal; SHARE = the Survey of Health, Ageing and Retirement in Europe; WVS = the World Values Survey. BFI-10 = the Ten-Item Big Five Inventory; BFI-S = the Big Five Inventory-SOEP; BFI-2-XS = the extra-short form of the Big Five Inventory–2 (BFI-2); BFI-2-S = the short form of the Big Five Inventory–2; BFI-K = the German-language short form of the Big Five Inventory (BFI); NEO-FFI = the NEO Five-Factor Inventory. CAWI = computer-assisted web interview; CAPI = computer-assisted personal interview; CASI = computer-assisted self-interview; CATI = computer-assisted telephone interview; SAQP = self-administered questionnaire, paper; PAPI = paper-and-pencil interview.
a The depicted N for FReDA is from the recruitment wave. Personality was (first) assessed in 2022, in wave 2B. The data of wave 2B is planned to be released in 2024. b The German Internet Panel (GIP) includes three cohorts, initially sampled in 2012, 2014, and 2018. The 2012 and 2014 cohorts are household samples; the 2018 cohort is a person sample. In 2018, the BFI-10 was administered only to the newly recruited participants. c TwinLife includes three additional cohorts that comprise families with twins aged 5 years (Cohort 1), families with twins aged 11 years (Cohort 2), and families with twins aged 17 years (Cohort 3). On the first measurement occasion, the total N was 14,413.
Research Design
Only a few of the identified survey programs are cross-sectional or repeated cross-sections in which independent samples are drawn for each wave (e.g., the German General Social Survey [ALLBUS], the World Values Survey [WVS]), whereas most of them are panel surveys following the same respondents over many years. In the period covered by the present paper, some of these panel surveys assessed the Big Five multiple times. For example, the Panel Analysis of Intimate Relationships and Family Dynamics (pairfam) and the SOEP reassessed the Big Five every four years, and the GESIS Panel did so yearly.
Sampling Design and Target Population
As noted earlier, we included only surveys that targeted either the general adult population or adult subpopulations in Germany (with a minimum age of 15 years, but with the majority of the target population aged 18 years or over). These adult subpopulations included, for example, the elderly (the Survey of Health, Ageing and Retirement in Europe [SHARE]), (un-)employed persons (the Panel Study Labour Market and Social Security [PASS]; the BIBB/BAuA Employment Survey), and highly educated persons (the DZHW Graduate Panel and PhD Panel). Further, we included only large-scale surveys comprising at least 1,500 respondents. We also restricted our selection to surveys based on randomly selected respondents, so that they were representative of the corresponding population. Compared with non-random samples, such as convenience samples or quota samples, such random samples have important advantages from the perspective of representativeness and correct statistical inference to the population level (Lohr, 2021). Random sampling in these surveys was done via random-route procedures developed by the Arbeitskreis Deutscher Markt- und Sozialforschungsinstitute (ADM; e.g., the survey Personality and Voting Behavior 2003), was register-based (e.g., ALLBUS, the GESIS Panel), or was a combination of both (e.g., the SOEP, the German Internet Panel [GIP]). For telephone surveys, established random digit dialing procedures for dual frame samples (e.g., Gabler et al., 2012) were used. In some cases, total universe samples of the specific target group were drawn (e.g., the DZHW PhD Panel).
Most of the longitudinal surveys included in our overview were regularly refreshed with new samples in order to address panel mortality. This ensured that their potential for longitudinal analyses was preserved despite dropout.
Thematic Foci of the Survey Programs
The thematic orientation differs greatly across the included survey programs. Most have more or less clear thematic foci. For example, pairfam focused on partnership and fertility; the German National Educational Panel Study (NEPS) on educational pathways and competence development; PASS on the labor market, poverty, and the welfare state; the BIBB/BAuA Employment Survey on qualification and working conditions; and SHARE on health and retirement. By contrast, the GESIS Panel, as an omnibus access panel, does not have a specific thematic focus, but rather includes a large variety of constructs according to the submitted modules (e.g., subjective health, environmental attitudes and behavior, attitudes toward refugees, social and political participation). All selected survey programs include a detailed assessment of sociodemographic background variables, such as education, socioeconomic status, income, and migration status.
Prompted by the COVID-19 pandemic, several surveys (e.g., GIP, the German TwinLife study, the GESIS Panel) included additional modules focusing on behavior, experiences, and attitudes during the pandemic.
Big Five Measures
The selected surveys differ in the measures used to assess personality. Nearly every survey that fulfilled our criteria used a short-scale variant of the Big Five Inventory (BFI; John et al., 2008; German adaptation by Rammstedt, 1997). The most likely reason for this is the fact that short and ultra-short forms of the BFI are available and can be used free of charge for research purposes.
Surveys such as the GESIS Panel, GIP, NEPS2, ALLBUS, SHARE,1 and the WVS used the BFI-10 (Rammstedt & John, 2007), which assesses each Big Five dimension with one positively keyed and one negatively keyed item, thereby implicitly controlling for acquiescence. Other surveys included more comprehensive measures. For instance, the SOEP, TwinLife, the German National Academics Panel Study (Nacaps), and the BIBB/BAuA Employment Survey used the Big Five Inventory-SOEP (BFI-S; Schupp & Gerlitz, 2008), a 15-item version of the BFI originally developed for the SOEP. Pairfam and PASS used the BFI-K (Rammstedt & John, 2005), a 20-item short form of the BFI. In one wave, the GESIS Panel also used the 30-item BFI-2-S (Rammstedt et al., 2020), a short scale version of the 60-item BFI-2 (Soto & John, 2017; German adaptation by Danner et al., 2019), a revised version of the BFI. Both the BFI-2 and the BFI-2-S allow the Big Five to be measured at both the domain and facet levels (three facets per domain). In a recent wave, the panel survey Family Research and Demographic Analysis (FReDA) used the even more abbreviated form of the BFI-2, the BFI-2-XS (Rammstedt et al., 2020).
Only one of the selected surveys—Personality and Voting Behavior 2003—included in addition to BFI scales a personality measure that did not hail from the BFI family, namely, the NEO Five-Factor Inventory (NEO-FFI; Costa & McCrae, 1989, German adaptation Borkenau & Ostendorf, 1991), the 60-item short form of the NEO Personality Inventory (NEO-PI; Costa & McCrae, 1992). The GESIS Panel and Personality and Voting Behavior 2003 included different Big Five measures, thus allowing comparisons across instruments.
Regarding the response scales used, the BFI-10, BFI-2-XS, BFI-K were always administered with a 5-point rating scale as suggested by Rammstedt and John (2005, 2007) and Rammstedt et al. (2020). The BFI-s was mostly administered using a 7-point rating scale (i.e., SOEP, Nacaps, NAKO,PIAAC-L, TwinLife). In one other study using the BFI-S (i.e., BIBB-BAuA), however, a 5-point scale was used. The NEO-FFI was also assessed using a 7-point rating scale. While response scales were generally directed from disagreement to agreement, the response scales used in the ALLBUS and ISJP were oriented in the opposite direction (i.e., from agreement to disagreement).
Additional Psychological Constructs Included in the Selected Surveys
Besides the Big Five, most of the selected survey programs also assessed other core psychological constructs, such as intelligence (e.g., BIBB-BAuA Employment Survey, NEPS, the SOEP), human values (e.g., the GESIS Panel, the WVS), or more specific constructs, such as locus of control (e.g., the BIBB-BAuA Employment Survey, the SOEP), procrastination (e.g., the BIBB-BAuA Employment Survey), and achievement motivation (e.g., NEPS).
Nearly all of the selected surveys included measures of general and/or specific satisfaction with life. Several panel surveys (e.g., the SOEP, the GESIS Panel) also measured critical life events (since the preceding survey wave).
Big Five Reports From/About Additional Respondents
All of the selected survey programs included a self-report measure of the Big Five for the target person. In some surveys, other household members (e.g., the SOEP, PASS, GIP), relatives (e.g., romantic partners in pairfam, PIAAC-L and SHARE; twins, siblings, parents, partners in TwinLife), or even the interviewer (e.g., SHARE) were asked to provide information about their own personalities. In these cases, self-reports by these household members/relatives/partners/interviewers are available and can be compared with the personality self-reports of the target person. Big Five reports about others as assessed by the target person were included in pairfam, with the target person reporting about the Big Five personality of their child and of an idealized partner. Other-reports in which a third party assesses the personality of the target person were, to the best of our knowledge, not included in any of the survey (yet, the younger cohorts of TwinLife–which did not meet our inclusion criteria–included parent-reports about the personality of the target person (i.e., the child)).
Big Five Assessment Mode
The selected survey programs differ in their assessment modes. In some cases, assessment modes even differ among respondents of the same survey according to their assessment mode preferences (e.g., web-based or paper-and-pencil questionnaire; e.g., the GESIS Panel, GIP). In other cases, assessment modes differ over time/across assessments, because in one year the assessment was conducted as a personal interview and in other years as a telephone interview or a web-based questionnaire (e.g., NEPS, the SOEP, TwinLife). In this overview, we focus on the mode(s) of the Big Five assessment (see Table 2). The Big Five were commonly assessed in the form of a personal interview (with an interviewer reading out each question and coding the answer; e.g., PASS, SHARE, NEPS Starting Cohort 6). In other surveys, Big Five questionnaires were self-administered (without an interviewer present; e.g., the GESIS Panel, TwinLife, GIP).
Further Research Potential
Besides the analysis of associations between the Big Five and various outcome variables, associations among partners/household members, and potential longitudinal effects or methodological differences among instruments, samples, and modes, the data sets also have further research potential.
Paradata
For most of the selected survey programs, some form of paradata (i.e., data describing the data collection process) are provided. These may include the assessment date, the assessment duration, regional information, information about the assessment itself (where it took place, if others were present, etc.), or information about the interviewer (i.e., the person conducting the interview and recording the answers; e.g., SHARE). Such paradata can be used for both methodological (e.g., Cheng et al., 2020) and substantive analyses (allowing, e.g., analyses of the effects of weather on personality self-reports; see Rammstedt et al., 2015).
Regional information in particular allows survey data to be merged with geodata—for example, on pollution, regional wealth, or regional political orientation—which offers wide analytical potential (e.g., Ebert et al., 2022; for a general overview, see Bluemke et al., 2017).
International Survey Programs
Although most of the survey programs included in Table 1 are national surveys conducted in Germany only, some (e.g., SHARE, the International Social Survey Programme [ISSP], the WVS, the International Social Justice Project [ISJP]) are part of international comparative survey programs. In these cases, cross-nationally comparative analyses for the Big Five are possible (e.g., Levinsky et al., 2019; Rammstedt et al., 2013; Schmitt et al., 2008).
Replication and Integrative Data Analysis
Beyond their individual value as data sources, the selected survey programs offer unprecedented analytical potential when combined to answer a specific research question. Researchers can fruitfully combine multiple data sources in different ways. For example, they can conduct independent tests of the same hypothesis in multiple data sets to ascertain whether the results replicate across studies and are robust to variations in study design, measures, sample composition, and other survey characteristics. In some cases, it may even be possible to use meta-analytical techniques to combine results obtained in separate samples. This will contribute to building a more robust and replicable body of evidence in personality psychology. Additionally, for some research questions, researchers might want to pool and harmonize several data sources in order to use them for an integrative data analysis (see Curran & Hussong, 2009; Curran et al., 2008). Among other advantages, this may be a useful way of increasing statistical power or improving the coverage of certain sociodemographic subgroups or geographical units. Such a mega-analysis was for example conducted based on ten panel studies (also including the SOEP) to investigate the prospective associations of the Big Five with several life outcomes (Beck & Jackson, 2022).
Conclusion
The present paper aimed to provide personality researchers with an overview of data-sets from large-scale surveys in Germany that include measures of the Big Five. By that we aimed to increase the awareness and interest of psychologists—usually trained in primary data assessment and usage—in reusing these available high-quality datasets as they provide a broad research potential and clear methodological advantages compared to the typically used small-scale selective samples. This potential includes, on the one hand, substantive issues, such as concurrent associations between the Big Five and a broad variety of outcome variables (e.g., Denissen et al., 2018) or personality change over time and across cohorts. Also of interest from a personality psychology point of view are associations between the Big Five and the additional psychological constructs assessed (e.g., intelligence; see, e.g., Rammstedt et al., 2016), or similarities between personality self-ratings of target persons and their partners (see, e.g., Rammstedt & Schupp, 2008) or (other) household members.
On the other hand, the available longitudinal data allow researchers to predict outcomes based on previously assessed personality structure (e.g., COVID-19-related attitudes and behavior; Rammstedt et al., 2021) in order to investigate personality change based on repeated Big Five assessments (Lucas & Donnellan, 2011; Roemer et al., n.d.; Specht et al., 2011) and to draw stronger causal inferences (e.g., Anger et al., 2017; Sander et al., 2021).
Linked paradata in particular allow researchers to answer innovative research questions related to regional personality differences (e.g., Ebert et al., 2022; Obschonka et al., 2019) or differences in self-ratings depending on situation effects, for example, interviewer characteristics (e.g., Brunton‐Smith et al., 2017).
And finally, methodological questions, such as the effects of different assessment modes (e.g., Lang et al. 2011), acquiescence (e.g., Rammstedt et al., 2010), or response formats, can be answered by combining data from different studies.
Besides all the mentioned potentials and benefits of these high-quality large-scale data, such studies also suffer some drawbacks. For example, the included (personality) scales are usually only short scale measures with their limitations with regard to reliability and validity. In addition, per definition using secondary data does only allow to use the included constructs and their measures, which could undermine the fit for specific research questions. Also, the level of detail in the documentation of the survey programs varies. Finally, inflated error rates may occur when researchers use the same data to answer similar questions, or dependencies among research papers that may appear as presenting distinct evidence but are in fact based on the same data, or shared sampling bias and overfitting (for recent overviews, see, e.g., Mroczek et al., 2022; Thompson et al., 2020).
In this paper, we have tried to provide as comprehensive and complete an overview of the available surveys as possible. Because our search procedure was subjective in some regards, and was based partly on hearsay, we may have overlooked other available studies that would have met our criteria. To enable missed studies to be added, we have made our overview tables available in an OSF project (see Supplementary Materials), and any OSF user can post comments suggesting further surveys for inclusion.
As mentioned above, we restricted our overview to surveys conducted in Germany and focusing on the general adult population or on adult subpopulations. We are convinced that there is a similar need for a comparable overview of survey programs focusing, for example, on children and adolescents, or of survey programs conducted in countries other than Germany. For example, in addition to the two adult cohorts covered by the NEPS data sets included in this overview, NEPS provides additional data on the personality traits of primary and secondary school students. These data lend themselves to research on personality development and trait–outcome relationships, such as links between personality and achievement (e.g., N. D. Brandt et al., 2020; Lechner et al., 2017; Roemer et al., 2022).
In sum, with our paper we hope to inspire researchers to make more use of these valuable data sets, and to avail themselves of these data’s potential to answer important research questions.