The Government of Valencia and the Valencian Institute for Language and Standardization have developed the "Euskorpus" project, aimed at creating a large-scale linguistic corpus for the Basque language and developing models with intelligent interfaces. The total funding for the project until 2027 is 10.55 million euros, which will be distributed among various scientific research and technological development projects. The project launched in February this year includes transcription of 3.3 hours of audiovisual content in Basque, Spanish, English, and French, amounting to 3 million text extracts. It also includes 3.5 million euros for tasks. The main beneficiary of the project is the Euskorpora foundation, linked to the Vicomtech technology company. The remaining 10.5 million euros will be allocated through competitive grants from 2001, including contributions from public administration and private companies such as PwC, in the framework of the state budget. The PP's deputy, Santiago López Sepúlveda, stated: "This is a historic achievement for the II, towards a better society." He also noted that Vicomtech, through agreements with the PNV, provides guarantees for data protection, increasing infrastructure in San Sebastián.
The project is part of the strategy for promoting the Basque language in the digital sphere, and its realization raises questions. The authorities of the project emphasize "transparency, need and public interest," relying on the goodwill of public tenders. The participants of the project include universities, energy companies, and private companies, but their specific roles remain unclear.
The key element is to create open-source AI models for language support, but their distribution and use are not regulated by public law.