Art and celebrities

Study: The Kingdom leads the countries developing Arabic linguistic models in 2025


A recent study confirmed that The Kingdom List of developed countries The Arabic language in digital spaces, enhancing its ability to compete globally, supporting the presence of the Arabic language in the digital environment, and accelerating the adoption of innovation in institutions.
The study conducted by the Saudi Data and Artificial Intelligence Authority expanded "Sdaya"in cooperation with King Salman International Academy for the Arabic Language, to support the development of the artificial intelligence system in the Arabic language, and to determine the requirements for developing models more capable of understanding the Arabic language and its various dialects, generating content and implementing instructions.

Rules before the year 2000

The study dealt with the history of the development of Arabic linguistic models from their beginnings based on rule-based systems before the year 2000 AD, through statistical models and neural networks, all the way to the stage of large linguistic models and their contemporary generative applications. During the period from 2022 to 2025.
This phase witnessed the launch of dozens of Arabic models, including dialogic and generative models directed to support the Arab need in the technical, educational and cognitive fields.
The study monitored more than 53 Arabic linguistic models until the first quarter of 2025, and the Kingdom of Saudi Arabia topped the list of countries developing these models, while international bodies showed a noticeable interest in developing linguistic models that support the language. Arabic.
The analyzes showed weak investment in Arabic linguistic models that support audio and visual formats, despite their future importance, as 81% of these models were unimodal and dealt with texts only, while the percentage of multimodal models was 7%.

Cognitive and inferential capabilities of programs

As for capabilities, the study showed that the Arabic linguistic models included 3 main tasks, namely: understanding the language, generating content, conversation, and executing instructions, while the cognitive and inferential capabilities, multilingualism, and software support are still at a low level compared to international linguistic models.
According to the results of the evaluation of the standard scale (Balsam) issued by the King Salman International Complex For the Arabic language, which compares the performance of Arabic linguistic models with their international counterparts in Arabic linguistic tasks, it showed the superiority of international models in the majority of categories of linguistic skills, and at the same time the results reflected promising strengths of some Arabic models in some specific tasks, and they excelled slightly in the skill of summarizing.
While they provided similar performance in creative writing and reading comprehension tasks.
The study reviewed the current status of Arabic models, noting the existence of models developed in Arab countries, most notably the Kingdom of Saudi Arabia. and the Emirates, in addition to models developed in international bodies that support the Arabic language.
It revealed the existence of gaps, the most notable of which are the limited sizes of the models and the number of their parameters compared to international models, the lack of comprehensive Arabic data, and the scarcity of Arabic reference standards specialized in evaluating performance.

Providing high-quality Arabic data

The study developed a roadmap that clarifies practical steps to achieve leadership in large Arabic linguistic models, by focusing on providing Arabic data with high quality and comprehensiveness for various dialects. fields, developing linguistic models of various capabilities and sizes, and building Arabic reference standards to evaluate the quality of the models.
In addition to supporting the adoption of Arabic models locally through government and private institutions, and disseminating them for community use.

Empowering digital Arabic content

This study comes within the framework of cooperation between "Sdaya" And the King Salman International Academy for the Arabic Language, as a qualitative step that reflects the Kingdom’s interest in combining linguistic and cultural identity with technical development, and working to ensure the presence of the Arabic language in the global artificial intelligence system, so that this cooperation strengthens the Kingdom’s position as a leading regional center in developing Arabic language technologies and enabling digital Arabic content.

Related Articles

Back to top button