About Shekar

An open and community-driven ecosystem for advancing Persian Natural Language Processing.

Shekar is an open-source initiative focused on building high-quality tools, datasets, and models for Persian NLP. The project is designed to support researchers, developers, and students who work with Persian language data in academic and real-world settings.

Our goal is to remove barriers to working with Persian by providing modern, well-documented, and freely available resources that meet current research and engineering standards.

Principles

Shekar is built around openness, reproducibility, and long-term sustainability. All components are released under permissive licenses, with an emphasis on clean data, transparent evaluation, and practical usability.