Few expressed pain and longing with the intensity of Amália Rodrigues, the iconic fado singer who became a symbol of Portuguese cultural identity. Her voice, her language, and her emotion are all part of a legacy that continues to shape Portugal’s artistic and emotional landscape. Drawing inspiration from that deep cultural well, AMALIA (Automatic Multimodal Language Assistant with Artificial Intelligence) is the name chosen for the first Portuguese Large Language Model (LLM) designed from scratch to reflect and preserve the richness of the Portuguese language and identity – with INESC-ID playing a crucial role, particularly in the area of speech processing.

Derived from the Latin word for “fate,” fado conveys a broad spectrum of emotions, from heartbreak and nostalgia to joy and resilience. Similarly, AMALIA is being designed to understand, process, and generate content in European Portuguese, capturing nuances in both language and culture. “This tool will serve a wide range of applications across essential sectors such as education, media, science, cultural heritage, and public administration”, anticipates INESC-ID researcher and Professor at Técnico, Alberto Abad, from Human Language Technologies.

A strategic national investment

Supported under Portugal’s Recovery and Resilience Plan (PRR) and coordinated by the Foundation for Science and Technology (FCT), AMALIA is being developed by a national consortium of top academic and research institutions. This includes Universidade de Lisboa, via Instituto Superior Técnico, Universidade NOVA, the Universidade do Porto, Universidade de Coimbra, Universidade do Minho, and the national laboratories NOVA LINCS, IT, INESC TEC, CISUC/LASI, and ALGORITMI/LASI. Experts from the University of Beira Interior and the University of Évora are also contributing.

Under the coordination of Alberto Abad, INESC-ID’s contribution focuses on multimodal language processing, particularly the integration of spoken language. This means AMALIA will not only be able to interpret text but also receive and process speech and images – giving it “ears” and “eyes,” with the “brain” generating accurate and contextually aware text responses.

Unlike commercial AI models primarily optimized for global markets, AMALIA is trained from the scratch using resources such as Arquivo.pt and is specifically tailored for European Portuguese. It will be open source and designed to operate in closed and secure environments, ensuring data protection and reinforcing national technological sovereignty.

AMALIA will serve as a strategic asset for Portugal – not just as a language model, but as a digital guardian of linguistic and cultural heritage. In an age when companies tend to prioritize broader language variants like Brazilian Portuguese, AMALIA’s focus on the European variant is both a cultural imperative and a technical challenge.

Filling a niche

By September 2025, the consortium aims to release a public version of the model. A first internal version was successfully launched on March 31, 2025, already capable of engaging in contextual conversations and demonstrating knowledge of Portuguese culture and language.

“AMALIA will not replace general-purpose models like ChatGPT”, Alberto Abad underlines. “Instead, it fills a vital niche: delivering specialized, context-sensitive responses in domains where language, culture, and data privacy matter.” Its potential spans education, public service, cultural preservation, and more.

As Fernando Pessoa once said, “My homeland is the Portuguese language (A minha pátria é a língua portuguesa).” With AMALIA, that homeland now has a voice in the digital future. One that speaks, understands, and respects its unique identity.