Yesterday, we published an English – Kabyle Parallel Corpus.
130 883 aligned sentence pairs extracted from our contributions and contributions of the community on Tatoeba database.
The corpus is aligned pair by sentence-id (en-kab).
By number of sentences, Kabyle language is ranked 5th on Tatoeba with 772 002 submitted sentences (september 14th, 2025).
The dataset will be updated from time to time via :
HF dataset : https://huggingface.co/datasets/Imsidag-community/english-kabyle-parallel