NEUROTATARLAR | НЕЙРОТАТАРЛАР
Popular repositories Loading
-
awesome-tatar
awesome-tatar Public😎Awesome list about everything in Tatar 🌱Искиткеч татар галәме исемлеге
-
gemini-pdf-extractor
gemini-pdf-extractor PublicAutomated extraction of structured data from Tatar-language PDF documents using Google Gemini and Yandex Disk public links. Supports chunked processing, prompt engineering, and JSON output validation.
Python 4
-
Tatar-speech-tools
Tatar-speech-tools PublicPipelines, crawlers and tools for mining Speech-to-Text corpus for Tatar language
-
-
monocorpus
monocorpus PublicThe Monocorpus project is a collection of tools designed to facilitate the development of a Tatar language monocorpus
Python
-
Repositories
- gec-tt Public
Streaming Tatar (Cyrillic) grammar correction app with a Flutter client and FastAPI backend. SSE-based corrections, rate limiting, metrics, and a pluggable model adapter. Designed for low‑latency text correction and clear side‑by‑side editing.
neurotatarlar/gec-tt’s past year of commit activity - gec-annotations-filter Public
Tatar GEC data pipeline: ingests social media text, cleans/dedups, scores toxicity and GEC usefulness (Gemini), supports incremental updates, and exports curated annotation batches with proportional error-density sampling.
neurotatarlar/gec-annotations-filter’s past year of commit activity - gec-annotation-platform Public
Annotation platform for grammatical error correction (GEC) tasks, initially focused on the Tatar language.
neurotatarlar/gec-annotation-platform’s past year of commit activity - monocorpus Public
The Monocorpus project is a collection of tools designed to facilitate the development of a Tatar language monocorpus
neurotatarlar/monocorpus’s past year of commit activity - gemini-pdf-extractor Public
Automated extraction of structured data from Tatar-language PDF documents using Google Gemini and Yandex Disk public links. Supports chunked processing, prompt engineering, and JSON output validation.
neurotatarlar/gemini-pdf-extractor’s past year of commit activity - tahrirgoh Public Forked from tahrirchi/tahrirgoh
Tahrirgoh is a web platform for dataset collection for the Grammatical Error Correction (GEC) task. The only difference from original is translation of interface to Tatar language
neurotatarlar/tahrirgoh’s past year of commit activity
Top languages
Loading…
Most used topics
Loading…