Projects

Open-source libraries, datasets, and models built to advance Persian NLP.

Shekar Python Library

A high-performance Persian NLP library providing tokenization, normalization, POS tagging, embeddings, and autocorrection using large-scale curated corpora.

Neyshekar

A community-driven voice collection platform for Persian speech, designed to support ASR research and low-resource language modeling.