hyperconsolidated (manual) - the consolidated sheet, further consolidated along the following guidelines. for the top 15 languages*, the total counts include any further "Language" entries with at least 25 records that belong to one or more of the top languages. (for example: the language entry "de" was added to the total for "German"; the language entry "French-Russian" was added to the total for both "French" and "Russian".) the point of this exercise was to give an arguably more accurate count for the most common languages on Library Genesis, with the major caveats that 1) the consolidation was done manually & therefore counts are probably not accurate 2) libgen's metadata is riddled with errors & therefore counts are certainly not accurate. from this flawed data, top language uploads as percentages of all uploads to main (non-fiction) & fiction databases have been generated. in this dataset, almost 99% of all uploads are in just 14 languages.
overview-languages_2021-11-27.ods - overview-languages_2021-11-27.xlsx, saved in an alternative format.
* the #15 overall "Language" entry was Russian(Old), which was combined with Russian for the hyperconsolidated (manual) totals.