StyleSurvey

We welcome additions via Pull Requests.

Predefined features (stylometry)

Python tools

Tool Notes / Link
LIWC Pennebaker et al., 2015
LFTK Lee and Lee, 2023
NeuroBiber Alkiek et al., 2025
Multidimensional Analysis Tool (MAT) Nini, 2019
StyloSpeaker (for speakers in speech transcripts) Aggazzotti et al., 2025
Writeprints+ (PAN authorship verification) Weerasinghe and Greenstadt, 2020
PAN style change detection Strøm, 2021, Zuo et al., 2019, Zlatkova et al., 2018
elfen (Efficient Linguistic Feature Extraction) Maurer, 2026 (paper)
gram2vec Zeng et al., 2025 (paper)

Other less comprehensive stylometry tools

Tool Notes / Link
Classifying writing styles within a document Elahi and Muneer, 2018
Supervised Stylometry (SuperStyl) Camps and Cafiero, 2024

Non-Python tools

Tool Notes / Link
Stylo (R) Eder et al., 2016
JStylo (Java) PSAL, 2013
Coh-Metrix (for text cohesion and readability) Graesser et al., 2004
Signature Millican, 2003

Tools for languages other than English

Language Tool Paper
Spanish Tulengua Carreras-Riudavets et al., 2025
Multilingual Stylometry (Python and R) D33 (CLS-INFRA) Schöch et al., 2024
Historical Persian parsipy Farsi et al., 2025
Cross-language/bilingual DT-grams Murauer & Specht, 2021

Papers using stylometry for languages other than English (no tools released)

Language Features Paper
Latin POS tag features Chen et al., 2024
Urdu stylometric Nazir et al., 2021
Bengali stylometric, n-grams Hossain et al., 2020
Hinglish n-grams Sharma et al., 2018
Chinese tones/rimes Hou & Huang, 2019
Chinese character/rhyme/genre/overlapped words Tang et al., 2019
Arabic poetry stylometric Ahmed et al., 2019
EN/FR/IT/SP n-grams, word2vec, TFIDF (PAN 2019) Rahgouy et al., 2019

Automatically Learned Representations

Huggingface Models

Model Link
CISR https://huggingface.co/AnnaWegmann/Style-Embedding
StyleDistance https://huggingface.co/StyleDistance/styledistance
mStyleDistance https://huggingface.co/StyleDistance/mstyledistance
LUAR https://huggingface.co/rrivera1849/LUAR-MUD
Multilingual Style Representation https://huggingface.co/Blablablab/multilingual-style-representation-Llama-3.2

Other Models

Model Link
LISA https://ajayp.app/posts/2023/11/learning-interpretable-embeddings-via-llms/

Explanation tools

Tool Notes / Link
Latent Style Interpretation GitHub, HuggingFace demo

Scripts

Project Link
Learning Invariant Representations of Social Media Users https://github.com/noa/iur
A Deep Metric Learning Approach to Account Linking https://github.com/noa/naacl2021
Style is NOT a single variable: Case Studies for Cross-Style Language Understanding https://github.com/dykang/xslue
Does It Capture STEL? A Modular, Similarity-based Linguistic Style Evaluation Framework https://github.com/nlpsoc/STEL
On the State of the Art in Authorship Attribution and Authorship Verification https://github.com/JacobTyo/Valla