Building Arabic AI from the ground up

Special Building Arabic AI from the ground up
Short Url
Updated 25 September 2025
Follow

Building Arabic AI from the ground up

Building Arabic AI from the ground up
  • From language depth to data security, regional AI must reflect local values, priorities

ALKHOBAR: When Saudi Arabia unveiled Allam, its homegrown Arabic large language model, it sent a clear signal: the Kingdom is no longer content to simply consume global AI technologies. 

It intends to build its own. For many, this was a moment of pride — a proof that the Arab world can produce tools designed to understand its own languages, cultures, and contexts.

But experts caution that Allam is only the first step in a much longer journey. Success will not be determined by the models alone, but by the invisible foundations that support them: data, infrastructure, governance, and trust.

“You can’t capture the intent, emotion, and cultural depth of Arabic through translation,” said David Barber, director of the UCL Centre for Artificial Intelligence and Distinguished Scientist at UiPath. “You need systems that think in Arabic from the ground up.”




David Barber, director, UCL Centre for Artificial Intelligence; distinguished scientist at UiPath. (Supplied)

Barber highlights a stark reality: only about 15 percent of Arabic text online is clean enough for training a large language model, compared with over 50 percent for English — a huge head start for models like GPT or Claude. Complicating matters further are Arabic’s complex grammar, diverse dialects, and the common mixing of English and Arabic in a single sentence.

“When you train on noisy or shallow data, the system learns shortcuts,” Barber explained. “It can mimic fluency, but it misses the depth, the idioms, the cultural nuances, the rhythm of thought that makes Arabic distinct.”

For Barber, this underscores the importance of Saudi Arabia’s push for locally sourced, high-quality datasets. Without them, any Arabic LLM risks becoming a shallow copy of English-language AI: competent at generic tasks but unable to capture the soul of the language it claims to represent.

Even the best data is ineffective if it cannot be properly organized, secured, and delivered to the model. Seema Alidily, regional director at Denodo, said Gulf enterprises still face major challenges here.

“Without localized infrastructure, AI systems risk misunderstanding user intent or producing irrelevant outputs,” she said. “Data virtualization is one of the few ways to unify governance and access across cloud and on-site systems without moving sensitive information.”




Seema Alidily, regional director, Denodo. (Supplied)

Practically, this means investing in platforms that can pull data from dozens of scattered sources — from ERP systems to IoT sensors— and present it in a unified view for AI to use. In Saudi Arabia, where Vision 2030 projects depend on massive, real-time datasets, this approach is critical, especially given strict regulations on handling citizen data.

Alidily warned that merely replicating Western infrastructure may not suffice. “In the Gulf, centralized visibility and compliance must come first,” she noted. “It is not just a technical issue, it is about aligning with the legal, cultural, and regulatory expectations of the region.”

For Bader AlBahaian, country manager for Saudi Arabia at VAST Data, the stakes go beyond efficiency — they touch on independence and security.

“If we depend exclusively on external platforms, we risk importing their policies and their priorities, often at the expense of regional needs,” he said.




Bader AlBahaian, country manager, Saudi Arabia, VAST Data. (Supplied)

AlBahaian advocates for “sovereign-by-design” systems: storage and compute architectures that keep sensitive data within national borders, encryption and access controls that satisfy local regulators, and AI models trained under rules set by the Kingdom rather than a foreign vendor.

“It is not just about where the data sits,” he added. “It is about who gets to define how it is used, who takes responsibility when something goes wrong, and who has the power to switch the system off if necessary.”

This question of sovereignty is becoming urgent as AI begins to shape decisions in finance, healthcare, education, and public policy. A misaligned model trained on foreign data could issue recommendations that contradict local priorities — or worse, expose the region to economic or political risks.

But building perfect infrastructure is only half the challenge. Success ultimately depends on how AI is deployed.

“Digital labor will allow businesses to have much deeper relationships with their customers,” said Ibrahim Alseghayr, managing director of Salesforce Saudi Arabia. “And by taking on so much of the routine work, AI frees humans to focus on collaboration, creativity, and critical thinking.”




Ibrahim Alseghayr, managing director of Salesforce Saudi Arabia. (Supplied)

Alseghayr points to Agentic AI — systems that can act on a company’s behalf — as already transforming service centers, financial operations, and citizen engagement platforms. In Saudi Arabia, he sees huge potential for digital labor in scaling mega-projects like Neom, automating logistics networks, and delivering smarter healthcare services.

He cautioned that this transformation must be carefully managed. “We need strong governance, testing environments, and continuous oversight,” he said. “Otherwise, we risk building tools we do not fully understand, and that could erode trust instead of building it.”

Across all four experts, one theme is clear: global rules and imported frameworks will not suffice. The Arab world must craft its own AI governance models, rooted in its cultural and legal realities.

For Barber, Allam is a test case. “This is the Kingdom’s chance to prove that it can build systems that are not only technically powerful but also aligned with its values,” he added.

DID YOU KNOW?

• Arabic’s complex grammar, dialect diversity, and frequent English–Arabic mixing make it one of the hardest languages for AI to master. 

• Saudi Arabia’s Allam is the first homegrown Arabic large language model, designed to think in Arabic rather than translate from English. 

• Vision 2030 projects depend on real-time data, but regulations require strict handling of citizen information.

“Agentic AI can create personalized treatment plans, autonomously monitor patients, and detect early signs of health deterioration before a doctor ever enters the room,” he said.Alidily agrees, emphasizing that governance frameworks must reflect the Gulf’s unique data protection requirements, with regulators working closely with technology providers to define shared standards.

AlBahaian is even more direct. “Trust is earned through systems, not slogans. People need to know where their data is, who is using it, and for what purpose. That is the only way to build confidence at scale.”

The message is clear: Arabic AI’s future will not be decided by model size alone. It will depend on investments in infrastructure, sovereignty, and governance.

Saudi Arabia has taken the first step with Allam. What comes next — the data pipelines, virtualized infrastructure, sovereign controls, and digital labor deployments — will determine whether the Kingdom becomes a true AI creator or remains a buyer of foreign-built intelligence.

 


Kuwait leads Gulf non-oil growth as Egypt stabilizes and Qatar slows: S&P Global PMI 

Kuwait leads Gulf non-oil growth as Egypt stabilizes and Qatar slows: S&P Global PMI 
Updated 9 sec ago
Follow

Kuwait leads Gulf non-oil growth as Egypt stabilizes and Qatar slows: S&P Global PMI 

Kuwait leads Gulf non-oil growth as Egypt stabilizes and Qatar slows: S&P Global PMI 

RIYADH: Gulf business conditions diverged in October as Kuwait’s non-oil sector strengthened, Qatar’s non-energy growth slowed, and Egypt’s contraction eased to an eight-month low. 

According to the latest S&P Global Purchasing Managers’ Index surveys, Kuwait’s PMI rose to 52.8, indicating solid growth; Qatar’s PMI slipped to 50.6, pointing to only a marginal upturn; and Egypt’s index increased to 49.2, suggesting a softer decline in business activity. 

In Egypt, the non-oil private sector showed signs of stabilization as declines in output and new orders moderated.  

The PMI rose from 48.8 in September to 49.2 in October, remaining below the 50 threshold that separates growth from contraction but above its long-term trend. 

“The Egypt PMI stayed above its long-term trend in October, pointing to a year-on-year GDP growth rate of about 4.6 percent,” said David Owen, senior economist at S&P Global Market Intelligence.

However, he cautioned that “rising cost pressures could slow things down if companies struggle to absorb these costs.” 

Wage costs climbed at the fastest rate since 2020, lifting input inflation, though firms largely held prices steady to support sales. 

In Kuwait, non-oil firms reported faster increases in output, new orders, and employment, marking the most robust expansion in several months.  

The PMI climbed to 52.8 from 52.2 in September. “The October PMI data for Kuwait help to allay any fears that the recent growth slowdown was going to result in a more prolonged soft patch,” said Andrew Harker, economics director at S&P Global Market Intelligence.

Hiring grew at the fastest pace in four months, but staff shortages contributed to a further accumulation of backlogs.

Companies also faced sharper rises in input and staff costs, yet output prices rose only marginally as firms sought to remain competitive and secure new business.

Meanwhile, Qatar’s non-energy private sector recorded a slowdown, with the headline PMI easing to 50.6 in October from 51.5 in September, the weakest reading since January.

The decline reflected softer output and new order volumes, with construction activity showing notable weakness. 

“Qatar’s non-energy private sector continued to report an overall improvement in business conditions in October,” said Trevor Balchin, economics director at S&P Global Market Intelligence.

That said, he added, the headline PMI eased to a nine-month low of 50.6, signaling only a fractional upturn.

Despite weaker demand, employment increased at one of the fastest rates on record, led by gains in manufacturing.

Firms also reported rising wages and purchase prices but lower overall input costs as competitive pressures weighed on selling prices.