Don’t pull any old personality taxonomy from the shelf: The performance of historical and sample derived taxonomies in extracting personality information from text [Author Accepted Manuscript]
Author(s) / Creator(s)
Karl, Johannes A.
Fischer, Ronald
Abstract / Description
Substantial efforts have been made to develop comprehensive taxonomies of personality traits in many languages. Nevertheless, given that what is important and salient in individuals’ lived experience might be changing over time, this raises the question about the long-term usefulness of ‘off-the-shelf’ taxonomies developed decades ago. In the current study we used a bottom-up approach to create a large sample-specific taxonomy of personality terms. We subsequently examined the overlap and sensitivity of this taxonomy compared to an established trait taxonomy in the same language. Overall, we found that the two taxonomies only showed limited overlap with a pronounced divergence in emotionality (Neuroticism) and social aspects (Agreeableness) of personality. In addition to this, we found that while the personality assessment extracted from self-descriptions using the established taxonomy showed alignment with participants’ self-rated personality, especially Extraversion, Agreeableness, and Neuroticism, the sample-specific taxonomy showed a significantly greater alignment between text-based and self-rated personality. In summary, our current study highlights the need to extend our thinking about the psycholexical hypothesis, moving away from assumptions of time invariant language encoding to more explicitly recognizing temporal and sample-specific dynamics underpinning the expression and use of personality trait terms.
Keyword(s)
lexical hypothesis text-based personality assessment text mining sample specific taxonomy Big FivePersistent Identifier
Date of first publication
2026-01-20
Journal title
Measurement Instruments for the Social Sciences
Publisher
PsychArchives
Publication status
acceptedVersion
Review status
reviewed
Is version of
Citation
Karl, J. A., & Fischer, R. (in press). Don’t pull any old personality taxonomy from the shelf: The performance of historical and sample derived taxonomies in extracting personality information from text [Author Accepted Manuscript]. Measurement Instruments for the Social Sciences. https://doi.org/10.23668/psycharchives.21591
-
Karl_Fischer_2026_Sample_derived_personality_taxonomy_MISS_AAM.pdfAdobe PDF - 576.53KBMD5 : 1a2ee353cf1ff5dc15b0a8bff0aa11a7Description: Accepted Manuscript
-
There are no other versions of this object.
-
Author(s) / Creator(s)Karl, Johannes A.
-
Author(s) / Creator(s)Fischer, Ronald
-
PsychArchives acquisition timestamp2026-01-20T13:23:46Z
-
Made available on2026-01-20T13:23:46Z
-
Date of first publication2026-01-20
-
Abstract / DescriptionSubstantial efforts have been made to develop comprehensive taxonomies of personality traits in many languages. Nevertheless, given that what is important and salient in individuals’ lived experience might be changing over time, this raises the question about the long-term usefulness of ‘off-the-shelf’ taxonomies developed decades ago. In the current study we used a bottom-up approach to create a large sample-specific taxonomy of personality terms. We subsequently examined the overlap and sensitivity of this taxonomy compared to an established trait taxonomy in the same language. Overall, we found that the two taxonomies only showed limited overlap with a pronounced divergence in emotionality (Neuroticism) and social aspects (Agreeableness) of personality. In addition to this, we found that while the personality assessment extracted from self-descriptions using the established taxonomy showed alignment with participants’ self-rated personality, especially Extraversion, Agreeableness, and Neuroticism, the sample-specific taxonomy showed a significantly greater alignment between text-based and self-rated personality. In summary, our current study highlights the need to extend our thinking about the psycholexical hypothesis, moving away from assumptions of time invariant language encoding to more explicitly recognizing temporal and sample-specific dynamics underpinning the expression and use of personality trait terms.en
-
Publication statusacceptedVersion
-
Review statusreviewed
-
CitationKarl, J. A., & Fischer, R. (in press). Don’t pull any old personality taxonomy from the shelf: The performance of historical and sample derived taxonomies in extracting personality information from text [Author Accepted Manuscript]. Measurement Instruments for the Social Sciences. https://doi.org/10.23668/psycharchives.21591
-
ISSN2523-8930
-
Persistent Identifierhttps://hdl.handle.net/20.500.12034/16975
-
Persistent Identifierhttps://doi.org/10.23668/psycharchives.21591
-
Language of contenteng
-
PublisherPsychArchives
-
Is version ofhttps://osf.io/preprints/psyarxiv/muyc2_v1
-
Is version ofhttps://doi.org/10.5964/miss.16869
-
Keyword(s)lexical hypothesis
-
Keyword(s)text-based personality assessment
-
Keyword(s)text mining
-
Keyword(s)sample specific taxonomy
-
Keyword(s)Big Five
-
Dewey Decimal Classification number(s)150
-
TitleDon’t pull any old personality taxonomy from the shelf: The performance of historical and sample derived taxonomies in extracting personality information from text [Author Accepted Manuscript]en
-
DRO typearticle
-
Journal titleMeasurement Instruments for the Social Sciences
-
Visible tag(s)PsychOpen GOLD
-
Visible tag(s)Accepted Manuscript