Bangladesh faces hurdles in creating its own ChatGPT-level language model

Bangladesh is making strides with new Bangla language tools, but faces steep challenges developing full-scale Artificial Intelligence like ChatGPT as global tech giants dominate the field.

Bangla, with its complex grammar and numerous dialects, poses unique challenges for writers and language technologists. The surge of poorly written online content in Bangla further complicates matters, impacting the ability of international Artificial Intelligence systems, such as ChatGPT and Gemini, to generate accurate Bangla. In response, local innovators have stepped in. Shuddhi, developed by Sagor Sarker of Hishab Technologies, is a tool aimed at everyday Bangla writing, correcting grammar and spelling, supporting Banglish, and offering translation. Sarker envisions it as a ´Grammarly for Bangla,´ pending the acquisition of funding needed to develop additional features like paraphrasing, plagiarism detection, and fact-checking.

Hishab Technologies is also experimenting with homegrown large language models (LLMs). Their bilingual ´titulm-1b-bn-v1,´ trained on over a billion parameters and billions of Bangla tokens, handles both Bangla and English tasks, while drawing on popular model architectures like Llama, MPT, and Gemma. Public evaluation datasets for Bangla LLMs have also been produced to spur broader development within the ecosystem. Meanwhile, Intelsense AI has launched ´Ekush LLM,´ which touts itself as Bangladesh’s first model trained specifically on Bangla. Beyond text generation, Ekush leverages proprietary Voice Artificial Intelligence, with applications across telemedicine, agriculture, and more. These startups collaborate with local enterprises, aiming not only to improve workflow but to foster understanding and funding for Artificial Intelligence research in the country.

Despite visible progress, experts suggest Bangladesh’s path to homegrown, general-purpose LLMs like GPT-4 remains encumbered by infrastructure and resource limitations. Training state-of-the-art models—requiring thousands of advanced GPUs, vast datasets, and reliable electricity—is financially and technically formidable for local players. Most Bangladeshi companies instead fine-tune open-source models for narrow, domain-specific use cases or rely on API offerings from global leaders. Data privacy, too, is a rising concern, with the nation lacking robust frameworks akin to Europe’s GDPR. The deepest challenge may be cultural: while multilingual models increasingly ´understand´ Bangla, experts argue these systems lack the subtlety to capture local nuances and identity. As OpenAI’s models improve their Bangla, Dr Farig Sadeque of BRAC University suggests the focus should shift to ensuring LLMs authentically reflect Bangladesh’s unique linguistic and cultural character rather than trying to replicate a Western-style foundational model from scratch.

67

Impact Score

How the European Union’s digital rules shape innovation beyond its borders

The European Union’s expanding digital rulebook is setting global norms for platforms, data and artificial intelligence, but the model creates uneven impacts on startups and non-EU firms. Predictability and trust increase for some players, while fixed compliance costs and market-access rules weigh more heavily on smaller companies and foreign businesses.

Digital Europe Programme targets strategic technologies and sovereignty

The Digital Europe Programme channels more than €8.1 billion into strategic digital capacities, from supercomputing to semiconductors, to reduce Europe’s dependence on foreign technologies. It complements other European Union instruments to drive digital transformation, skills and industrial competitiveness across the bloc.

Contact Us

Got questions? Use the form to contact us.

Contact Form

Clicking next sends a verification code to your email. After verifying, you can enter your message.