| Issue |
EPL
Volume 152, Number 2, October 2025
|
|
|---|---|---|
| Article Number | 22003 | |
| Number of page(s) | 7 | |
| Section | Mathematical and interdisciplinary physics | |
| DOI | https://doi.org/10.1209/0295-5075/ae1253 | |
| Published online | 04 November 2025 | |
Core-periphery patterns in knowledge graphs reveal digital visibility hierarchies of South American languages
1 Universidad Tecnológica del Perú - Lima, Perú
2 Universidad Continental - Lima, Perú
Received: 14 August 2025
Accepted: 13 October 2025
Abstract
We analyze the digital representation of indigenous South American languages through a statistical-physics perspective, applying onion decomposition to a semantically enriched knowledge graph built from Wikipedia, Wikidata, and Glottolog. This multi-scale method reveals a pronounced core-periphery hierarchy with three distinct regions: i) a compact core of highly connected languages dominating digital visibility, ii) intermediate layers with strong family-level clustering, and iii) a sparse periphery of linguistically isolated languages. Quantitatively, core languages have, on average, ten times the degree of peripheral ones and are seven times more likely to possess complete genealogical metadata. The results provide measurable evidence of systemic biases in the digital documentation of cultural heritage. Beyond diagnosis, the analysis identifies strategic targets for intervention: intermediate-layer languages as potential bridges to enhance peripheral visibility, and completely isolated cases as priorities for urgent preservation. This approach offers a transferable framework for quantifying representational inequalities in cultural knowledge systems.
© 2025 EPLA. All rights, including for text and data mining, AI training, and similar technologies, are reserved
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.
