Research Papers
Pratilekha: Lightweight Indic Speech Recognition
Anuran Roy, Alchemyst Labs
A family of lightweight Indic STT models - supports 12 Indian languages with sub-200ms latency on edge devices, achieving ~20% relative improvement over base Whisper on Indic benchmarks.
STTIndic LanguagesLoRAEdge ML
Is large context window all you need?
Anuran Roy, Saptarshi Pani, Arnab Sengupta, Alchemyst Labs
This paper examines the trade-offs between context sizes and latencies, highlighting the need for improved context retrieval strategies that do not bloat query sizes to the concerned Large Language Models.
Context EngineeringLanguage ModelsLLMs