StyleSurvey

We welcome additions via Pull Requests.

Predefined features (stylometry)

Python tools

Tool Notes / Link
LIWC Pennebaker et al., 2015
LFTK Lee and Lee, 2023
NeuroBiber Alkiek et al., 2025
Multidimensional Analysis Tool (MAT) Nini, 2019
StyloSpeaker (for speakers in speech transcripts) Aggazzotti et al., 2025
Writeprints+ (PAN authorship verification) Weerasinghe and Greenstadt, 2020
PAN style change detection Strøm, 2021, Zuo et al., 2019, Zlatkova et al., 2018

Other less comprehensive stylometry tools

Tool Notes / Link
Classifying writing styles within a document Elahi and Muneer, 2018
Supervised Stylometry (SuperStyl) Camps and Cafiero, 2024

Non-Python tools

Tool Notes / Link
Stylo (R) Eder et al., 2016
JStylo (Java) PSAL, 2013
Coh-Metrix (for text cohesion and readability) Graesser et al., 2004
Signature Millican, 2003

Tools for languages other than English

Language Notes / Link
Spanish Carreras-Riudavets et al., 2025 (paper)
Multilingual Stylometry (Python and R) Schöch et al., 2024
Historical Persian Farsi et al., 2025 (paper)
Cross-language/bilingual DT-grams Murauer & Specht, 2021 (paper)

Papers using stylometry for languages other than English (no tools released)

| Language | Notes / Paper | | —- | ———— | | Latin | POS tag features Chen et al., 2024 | | Urdu | stylometric Nazir et al., 2021 | | Bengali | stylometric, n-grams Hossain et al., 2020 | | Hinglish | n-grams Sharma et al., 2018 | | Chinese | tones/rimes Hou & Huang, 2019, character/rhyme/genre/overlapped words Tang et al., 2019 | | Arabic poetry | stylometric Ahmed et al., 2019 | | EN/FR/IT/SP | n-grams, word2vec, TFIDF (PAN 2019) Rahgouy et al., 2019 |

Automatically Learned Representations

Huggingface Models

Model Link
CISR https://huggingface.co/AnnaWegmann/Style-Embedding
StyleDistance https://huggingface.co/StyleDistance/styledistance
mStyleDistance https://huggingface.co/StyleDistance/mstyledistance
LUAR https://huggingface.co/rrivera1849/LUAR-MUD
Multilingual Style Representation https://huggingface.co/Blablablab/multilingual-style-representation-Llama-3.2

Other Models

Model Link
LISA https://ajayp.app/posts/2023/11/learning-interpretable-embeddings-via-llms/

Explanation tools

Tool Notes / Link
Latent Style Interpretation GitHub, HuggingFace demo

Scripts

Project Link
Learning Invariant Representations of Social Media Users https://github.com/noa/iur
A Deep Metric Learning Approach to Account Linking https://github.com/noa/naacl2021
Style is NOT a single variable: Case Studies for Cross-Style Language Understanding https://github.com/dykang/xslue
Does It Capture STEL? A Modular, Similarity-based Linguistic Style Evaluation Framework. https://github.com/nlpsoc/STEL
On the State of the Art in Authorship Attribution and Authorship Verification https://github.com/JacobTyo/Valla
On the State of the Art in Authorship Attribution and Authorship Verification https://github.com/JacobTyo/Valla