The Big Five Personality Dimensions in Large-Scale Surveys: An Overview of 25 German Data Sets for Personality Research

In recent decades, the number of large-scale surveys that have included measures of the Big Five personality traits in their standard questionnaires has grown sharply both in Germany and internationally. Consequently, a vast, heterogeneous, high-quality data base is now readily available to personality psychologists for secondary analyses. In this paper, we provide an overview of 25 public large-scale surveys assessing the Big Five. Our aim is to increase researchers’ awareness of the availability and analytical potential of these data, and ultimately to increase their reuse. We restricted our selection to surveys of the adult population, conducted in Germany, based on probabilistic samples with a minimum sample size of 1,500 respondents, and assessing all Big Five dimensions with a validated Big Five instrument. We describe the study designs, the measures used to assess the Big Five, and the research potential of these valuable data. Relevance Statement In recent decades, the number of large-scale surveys that have included measures of the Big Five personality traits in their standard questionnaires has grown sharply both in Germany and internationally. Consequently, a vast, heterogeneous, high-quality data base is now readily available to personality psychologists for secondary analyses. In this paper, we provide an overview of 25 public large-scale surveys assessing the Big Five. Our aim is to increase researchers’ awareness of the availability and analytical potential of these data, and ultimately to increase their reuse. We restricted our selection to surveys of the adult population, conducted in Germany, based on probabilistic samples with a minimum sample size of 1,500 respondents, and assessing all Big Five dimensions with a validated Big Five instrument. We describe the study designs, the measures used to assess the Big Five, and the research potential of these valuable data. Key Insights overview of 25 public large-scale surveys assessing the Big Five description of analytical potential of these data aim to increase researchers’ awareness of the availability

In recent decades, the Big Five personality dimensions have become increasingly estab lished as a comprehensive framework to describe personality (e.g., John et al., 2008;McCrae & Costa, 2008).This has led to broad interest in their assessment, even in fields outside core personality research, such as sociology, economics, and epidemiology.Nowadays, the Big Five are included in most large-scale social surveys as an almost standard construct, like subjective well-being.
The data resulting from these large-scale social surveys are highly valuable for personality research.The public large-scale surveys include a variety of additional con structs, they rely mostly on population-representative or at least heterogeneous samples, and often follow a longitudinal design, allowing to address key research questions of personality psychologists.Moreover, these public large-scale surveys have numerous ad vantages in terms of sample size and sample quality compared to the typically self-con ducted small-scale studies still dominating personality psychology and adjacent fields.Self-conducted studies are usually based on small, selective samples of college students or-as it is increasingly common today-samples collected using Amazon Mechanical Turk (MTurk) (e.g., Webb & Tangney, 2022).As well-funded programs run by professio nals specialized in survey research methods, these surveys typically far exceed the scope of data collections that an individual researcher or research group could ever hope to carry out alone.Many of these survey programs comprise panel data or repeated cross-sectional data that enable longitudinal analyses, greatly expanding the type of research questions that can be answered and offering opportunities for causal inference (e.g., by using fixed-effects models).These data are usually freely available to personality psychologists (and other researchers) for secondary analyses.However, apart from a few highly prominent and widely used surveys, such as the German Socio-Economic Panel (SOEP; Goebel et al., 2019), most of the large-scale surveys presented in this paper do not appear to be widely known among personality psychologists and remain underutilized in current research.This is unfortunate because these surveys have enormous analytical potential for research on the development, consequences, or predictors of personality traits-a potential that has thus far lain largely dormant.Indeed, many of these surveys are like hidden gems that have yet to be discovered by personality psychologists.
The aim of the present paper is therefore to provide researchers in personality psy chology and beyond with an overview of these available and reusable data sets.Because they are so numerous, we deliberately limited this overview to surveys that (a) focused on the adult population, (b) were conducted in Germany, (c) were based on probability samples, (d) had a minimum sample size of about 1,500 respondents, and (e) included an assessment of all Big Five dimensions with a validated Big Five instrument.

Search Strategy
In the first step, we included surveys in our overview with which we were personally familiar by virtue of having worked extensively with them in the past.Second, to provide a more comprehensive and less subjective overview, we systematically searched the databases of the following German research data centers using the key words personality and Big Five: the GESIS Data Archive for the Social Sciences; the Research Data Center of the SOEP; the Research Data Centre of the German Centre for Higher Education Research and Science Studies (DZHW); the Research Data Center of the Federal Institute for Vocational Education and Training (BIBB); the Research Data Center of the Institute for Employment Research (IAB).Third, we screened well-known international survey programs for the inclusion of personality measures.Finally, we contacted experts affili ated with the identified surveys to solicit tips about additional data sources.These steps resulted in 25 data sets.Although this list may not be exhaustive, it does cover the most important and prominent large-scale surveys in Germany that include personality measures. 1 Big Five.These central aspects largely define the research questions that can be posed around the Big Five.In a next step, we sketch some research potentials that emerge from further characteristics in the surveys.In addition, Table 1 provides a structured overview of the substantive focus of each survey, and Table 2 summarizes details of their designs, sample sizes, personality measures, etc.

Research Design
Only a few of the identified survey programs are cross-sectional or repeated cross-sec tions in which independent samples are drawn for each wave (e.g., the German General Social Survey [ALLBUS], the World Values Survey [WVS]), whereas most of them are panel surveys following the same respondents over many years.In the period covered by the present paper, some of these panel surveys assessed the Big Five multiple times.For example, the Panel Analysis of Intimate Relationships and Family Dynamics (pairfam) and the SOEP reassessed the Big Five every four years, and the GESIS Panel did so yearly.

Sampling Design and Target Population
As noted earlier, we included only surveys that targeted either the general adult popula tion or adult subpopulations in Germany (with a minimum age of 15 years, but with the majority of the target population aged 18 years or over).These adult subpopulations included, for example, the elderly (the Survey of Health, Ageing and Retirement in Europe [SHARE]), (un-)employed persons (the Panel Study Labour Market and Social Security [PASS]; the BIBB/BAuA Employment Survey), and highly educated persons (the DZHW Graduate Panel and PhD Panel).Further, we included only large-scale surveys comprising at least 1,500 respondents.We also restricted our selection to surveys based on randomly selected respondents, so that they were representative of the corresponding population.Compared with non-random samples, such as convenience samples or quo ta samples, such random samples have important advantages from the perspective of representativeness and correct statistical inference to the population level (Lohr, 2021).Random sampling in these surveys was done via random-route procedures developed by the Arbeitskreis Deutscher Markt-und Sozialforschungsinstitute (ADM; e.g., the survey Personality and Voting Behavior 2003), was register-based (e.g., ALLBUS, the GESIS Pan el), or was a combination of both (e.g., the SOEP, the German Internet Panel [GIP]).For telephone surveys, established random digit dialing procedures for dual frame samples (e.g., Gabler et al., 2012) were used.In some cases, total universe samples of the specific target group were drawn (e.g., the DZHW PhD Panel).
Most of the longitudinal surveys included in our overview were regularly refreshed with new samples in order to address panel mortality.This ensured that their potential for longitudinal analyses was preserved despite dropout.Many survey programs provide multiple data versions that differ in their levels of sensitivity and potential for de-identification.Displayed in the column is the accessibility for the standard scientific use files; data versions that include more sensitive variable might entail higher access barriers (e.g., access via on-site use or remote desktops); which, however, are not further specified in this table.b Studies are part of international survey programmes.Thus, corresponding data from other countries are also available.

Thematic Foci of the Survey Programs
The thematic orientation differs greatly across the included survey programs.Most have more or less clear thematic foci.For example, pairfam focused on partnership and fertili ty; the German National Educational Panel Study (NEPS) on educational pathways and competence development; PASS on the labor market, poverty, and the welfare state; the BIBB/BAuA Employment Survey on qualification and working conditions; and SHARE on health and retirement.By contrast, the GESIS Panel, as an omnibus access panel, does not have a specific thematic focus, but rather includes a large variety of constructs according to the submitted modules (e.g., subjective health, environmental attitudes and behavior, attitudes toward refugees, social and political participation).All selected survey programs include a detailed assessment of sociodemographic background variables, such as education, socioeconomic status, income, and migration status.
Prompted by the COVID-19 pandemic, several surveys (e.g., GIP, the German Twin Life study, the GESIS Panel) included additional modules focusing on behavior, experien ces, and attitudes during the pandemic.

Big Five Measures
The selected surveys differ in the measures used to assess personality.Nearly every survey that fulfilled our criteria used a short-scale variant of the Big Inventory (BFI; John et al., 2008; German adaptation by Rammstedt, 1997).The most likely reason for this is the fact that short and ultra-short forms of the BFI are available and can be used free of charge for research purposes.
Surveys such as the GESIS Panel, GIP, NEPS2 , ALLBUS, SHARE, 1 and the WVS used the BFI-10 (Rammstedt & John, 2007), which assesses each Big Five dimension with one positively keyed and one negatively keyed item, thereby implicitly controlling for acquiescence.Other surveys included more comprehensive measures.For instance, the SOEP, TwinLife, the German National Academics Panel Study (Nacaps), and the BIBB/ BAuA Employment Survey used the Big Five Inventory-SOEP (BFI-S; Schupp & Gerlitz, 2008), a 15-item version of the BFI originally developed for the SOEP.Pairfam and PASS used the BFI-K (Rammstedt & John, 2005), a 20-item short form of the BFI.In one wave, the GESIS Panel also used the 30-item BFI-2-S (Rammstedt et al., 2020), a short scale version of the 60-item BFI-2 (Soto & John, 2017; German adaptation by Danner et al., 2019), a revised version of the BFI.Both the BFI-2 and the BFI-2-S allow the Big Five to be measured at both the domain and facet levels (three facets per domain).In a recent wave, the panel survey Family Research and Demographic Analysis (FReDA) used the even more abbreviated form of the BFI-2, the BFI-2-XS (Rammstedt et al., 2020).
Only one of the selected surveys-Personality and Voting Behavior 2003-included in addition to BFI scales a personality measure that did not hail from the BFI family, namely, the NEO Five-Factor Inventory (NEO-FFI; Costa & McCrae, 1989, German adap tation Borkenau & Ostendorf, 1991), the 60-item short form of the NEO Personality Inventory (NEO-PI; Costa & McCrae, 1992).The GESIS Panel and Personality and Voting Behavior 2003 included different Big Five measures, thus allowing comparisons across instruments.
Regarding the response scales used, the BFI-10, BFI-2-XS, BFI-K were always adminis tered with a 5-point rating scale as suggested by Rammstedt andJohn (2005, 2007) and Rammstedt et al. (2020).The BFI-s was mostly administered using a 7-point rating scale (i.e., SOEP, Nacaps, NAKO,PIAAC-L, TwinLife).In one other study using the BFI-S (i.e., BIBB-BAuA), however, a 5-point scale was used.The NEO-FFI was also assessed using a 7-point rating scale.While response scales were generally directed from disagreement to agreement, the response scales used in the ALLBUS and ISJP were oriented in the opposite direction (i.e., from agreement to disagreement). of our knowledge, not included in any of the survey (yet, the younger cohorts of TwinLife-which did not meet our inclusion criteria-included parent-reports about the personality of the target person (i.e., the child)).

Big Five Assessment Mode
The selected survey programs differ in their assessment modes.In some cases, assess ment modes even differ among respondents of the same survey according to their assessment mode preferences (e.g., web-based or paper-and-pencil questionnaire; e.g., the GESIS Panel, GIP).In other cases, assessment modes differ over time/across assessments, because in one year the assessment was conducted as a personal interview and in other years as a telephone interview or a web-based questionnaire (e.g., NEPS, the SOEP, Twin Life).In this overview, we focus on the mode(s) of the Big Five assessment (see Table 2).The Big Five were commonly assessed in the form of a personal interview (with an interviewer reading out each question and coding the answer; e.g., PASS, SHARE, NEPS Starting Cohort 6).In other surveys, Big Five questionnaires were self-administered (without an interviewer present; e.g., the GESIS Panel, TwinLife, GIP).

Further Research Potential
Besides the analysis of associations between the Big Five and various outcome variables, associations among partners/household members, and potential longitudinal effects or methodological differences among instruments, samples, and modes, the data sets also have further research potential.

Paradata
For most of the selected survey programs, some form of paradata (i.e., data describing the data collection process) are provided.These may include the assessment date, the as sessment duration, regional information, information about the assessment itself (where it took place, if others were present, etc.), or information about the interviewer (i.e., the person conducting the interview and recording the answers; e.g., SHARE).Such paradata can be used for both methodological (e.g., Cheng et al., 2020) and substantive analyses (allowing, e.g., analyses of the effects of weather on personality self-reports; see Rammstedt et al., 2015).
Regional information in particular allows survey data to be merged with geodata-for example, on pollution, regional wealth, or regional political orientation-which offers wide analytical potential (e.g., Ebert et al., 2022; for a general overview, see Bluemke et al., 2017).
Big Five Personality Dimensions in Large-Scale Surveys

International Survey Programs
Although most of the survey programs included in Table 1 are national surveys conduc ted in Germany only, some (e.g., SHARE, the International Social Survey Programme [ISSP], the WVS, the International Social Justice Project [ISJP]) are part of international comparative survey programs.In these cases, cross-nationally comparative analyses for the Big Five are possible (e.g., Levinsky et al., 2019;Rammstedt et al., 2013;Schmitt et al., 2008).

Replication and Integrative Data Analysis
Beyond their individual value as data sources, the selected survey programs offer unpre cedented analytical potential when combined to answer a specific research question.Researchers can fruitfully combine multiple data sources in different ways.For example, they can conduct independent tests of the same hypothesis in multiple data sets to ascertain whether the results replicate across studies and are robust to variations in study design, measures, sample composition, and other survey characteristics.In some cases, it may even be possible to use meta-analytical techniques to combine results ob tained in separate samples.This will contribute to building a more robust and replicable body of evidence in personality psychology.Additionally, for some research questions, researchers might want to pool and harmonize several data sources in order to use them for an integrative data analysis (see Curran & Hussong, 2009;Curran et al., 2008).Among other advantages, this may be a useful way of increasing statistical power or improving the coverage of certain sociodemographic subgroups or geographical units.Such a megaanalysis was for example conducted based on ten panel studies (also including the SOEP) to investigate the prospective associations of the Big Five with several life outcomes (Beck & Jackson, 2022).

Conclusion
The present paper aimed to provide personality researchers with an overview of datasets from large-scale surveys in Germany that include measures of the Big Five.By that we aimed to increase the awareness and interest of psychologists-usually trained in primary data assessment and usage-in reusing these available high-quality datasets as they provide a broad research potential and clear methodological advantages compared to the typically used small-scale selective samples.This potential includes, on the one hand, substantive issues, such as concurrent associations between the Big Five and a broad variety of outcome variables (e.g., Denissen et al., 2018) or personality change over time and across cohorts.Also of interest from a personality psychology point of view are associations between the Big Five and the additional psychological constructs assessed (e.g., intelligence; see, e.g., Rammstedt et al., 2016), or similarities between personality self-ratings of target persons and their partners (see, e.g., Rammstedt & Schupp, 2008) or (other) household members.
On the other hand, the available longitudinal data allow researchers to predict outcomes based on previously assessed personality structure (e.g., COVID-19-related attitudes and behavior; Rammstedt et al., 2021) in order to investigate personality change based on repeated Big Five assessments (Lucas & Donnellan, 2011;Roemer et al., n.d.;Specht et al., 2011) and to draw stronger causal inferences (e.g., Anger et al., 2017;Sander et al., 2021).
Linked paradata in particular allow researchers to answer innovative research ques tions related to regional personality differences (e.g., Ebert et al., 2022;Obschonka et al., 2019) or differences in self-ratings depending on situation effects, for example, interview er characteristics (e.g., Brunton-Smith et al., 2017).
And finally, methodological questions, such as the effects of different assessment modes (e.g., Lang et al. 2011), acquiescence (e.g., Rammstedt et al., 2010), or response formats, can be answered by combining data from different studies.
Besides all the mentioned potentials and benefits of these high-quality large-scale data, such studies also suffer some drawbacks.For example, the included (personality) scales are usually only short scale measures with their limitations with regard to reliabil ity and validity.In addition, per definition using secondary data does only allow to use the included constructs and their measures, which could undermine the fit for specific research questions.Also, the level of detail in the documentation of the survey programs varies.Finally, inflated error rates may occur when researchers use the same data to answer similar questions, or dependencies among research papers that may appear as presenting distinct evidence but are in fact based on the same data, or shared sampling bias and overfitting (for recent overviews, see, e.g., Mroczek et al., 2022;Thompson et al., 2020).
In this paper, we have tried to provide as comprehensive and complete an overview of the available surveys as possible.Because our search procedure was subjective in some regards, and was based partly on hearsay, we may have overlooked other available studies that would have met our criteria.To enable missed studies to be added, we have made our overview tables available in an OSF project (see Supplementary Materials), and any OSF user can post comments suggesting further surveys for inclusion.
As mentioned above, we restricted our overview to surveys conducted in Germany and focusing on the general adult population or on adult subpopulations.We are con vinced that there is a similar need for a comparable overview of survey programs focusing, for example, on children and adolescents, or of survey programs conducted in countries other than Germany.For example, in addition to the two adult cohorts covered by the NEPS data sets included in this overview, NEPS provides additional data on the personality traits of primary and secondary school students.These data lend themselves to research on personality development and trait-outcome relationships, such as links Note.ALLBUS = Allgemeine Bevölkerungsumfrage der Sozialwissenschaften [German General Social Survey]; BIBB/BAuA: BIBB = Bundesinstitut für Berufsbildung [Federal Institute for Vocational Education and Training]; BAuA= Bundesanstalt für Arbeitsschutz und Arbeitsmedizin [Federal Institute for Occupational Safety and Health]; DZHW = Deutsche Zentrum für Hochschul-und Wissenschaftsforschung [German Centre for Higher Education Research and Science Studies]; FReDA = Family Research and Demographic Analysis-the German Family Demography Panel Study; GGS = the Generations and Gender Survey; GIP = the German Internet Panel; GLES = the German Longitudinal Election Study; SOEP = the German Socio-Economic Panel; ISJP = the International Social Justice Project; ISSP = the International Social Survey Programme; Nacaps = the National Academics Panel Study; NAKO = Nationale Kohorte [German National Cohort]; NEPS = the German National Educational Panel Study; pairfam = Panel Analysis of Intimate Relationships and Family Dynamics-the German Family Panel; PASS = Panel Arbeitsmarkt und soziale Sicherung [Panel Study Labour Market and Social Security]; IAB = Institut für Arbeitsmarkt-und Berufsforschung [Institute for Employment Research (the research institute of the Federal Employment Agency]; PIAAC-L = the Programme for the International Assessment of Adult Competencies Longitudinal; SHARE = the Survey of Health, Ageing and Retirement in Europe; WVS = the World Values Survey.a . 4, Article e10769 https://doi. = Allgemeine Bevölkerungsumfrage der Sozialwissenschaften [German General Social Survey]; BIBB/BAuA: BIBB = Bundesinstitut für Berufsbildung [Federal Institute for Vocational Education and Training]; BAuA= Bundesanstalt für Arbeitsschutz und Arbeitsmedizin [Federal Institute for Occupational Safety and Health]; DZHW = Deutsche Zentrum für Hochschul-und Wissenschaftsforschung [German Centre for Higher Education Research and Science Studies]; FReDA = Family Research and Demographic Analysis-The German Family Demography Panel Study; GGS = Generations and Gender Survey; GIP = the German Internet Panel; GLES = the German Longitudinal Election Study; SOEP = the German Socio-Economic Panel; ISJP = the International Social Justice Project; NACAPS = the National Academics Panel Study; NAKO = Nationale Kohorte [German National Cohort]; NEPS = the National Educational Panel Study; pairfam = the Panel Analysis of Intimate Relationships and Family Dynamics; PASS = Panel Arbeitsmarkt und soziale Sicherung [Panel Study Labour Market and Social Security]; IAB = Institut für Arbeitsmarkt-und Berufsforschung [Institute for Employment Research, the research institute of the Federal Employment Agency]; PIAAC-L = Programme for the International Assessment of Adult Competencies Longitudinal; SHARE = the Survey of Health, Ageing and Retirement in Europe; WVS = the World Values Survey.BFI-10 = the Ten-Item Big Five Inventory; BFI-S = the Big Five Inventory-SOEP; BFI-2-XS = the extra-short form of the Big Five Inventory-2 (BFI-2); BFI-2-S = the short form of the Big Five Inventory-2; BFI-K = the German-language short form of the Big Five Inventory (BFI); NEO-FFI = the NEO Five-Factor Inventory.CAWI = computer-assisted web interview; CAPI = computer-assisted personal interview; CASI = computer-assisted self-interview; CATI = computer-assisted telephone interview; SAQP = self-administered questionnaire, paper; PAPI = paper-and-pencil interview.a The depicted N for FReDA is from the recruitment wave.Personality was (first) assessed in 2022, in wave 2B.The data of wave 2B is planned to be released in 2024.b The German Internet Panel (GIP) includes three cohorts, initially sampled in 2012, 2014, and 2018.The 2012 and 2014 cohorts are household samples; the 2018 cohort is a person sample.In 2018, the BFI-10 was administered only to the newly recruited participants.c TwinLife includes three additional cohorts that comprise families with twins aged 5 years (Cohort 1), families with twins aged 11 years (Cohort 2), and families with twins aged 17 years (Cohort 3).On the first measurement occasion, the total N was 14,413.

Table 2
Detailed Overview of the Included Data Sets and the Assessment of the Big Five