AU Law Compliance Check Tool Reveals Big Tech Technology Weaknesses In Meeting European Union Regulations

JAKARTA Several prominent artificial intelligence (AI) models have reportedly not fully met EU regulations in terms of cybersecurity and discriminatory output. A number of AI-models of genreative technology companies such as Meta, OpenAI, and Alibaba have shown shortcomings in some areas that are critical to compliance with the European AI Law (AI Act), which is expected to take effect in stages in the coming two years.

This AI law has been the subject of debate for years, especially after OpenAI's launch of ChatGPT at the end of 2022 which sparked widespread discussions of the potential existential risk of these AI models. The emergence of public concerns forced policymakers to draft stricter regulations related to "general-purpose" AI (GPAI), which includes generative AI technologies such as ChatGPT.

To test compliance with this regulation, a new tool developed by LatticeFlow AI, a startup from Switzerland, with their partners at ETH Zurich and Bulgaria's INSAIT, was used to test a generative AI model. The tool assesses AI models of various categories with scores between 0 and 1, where these categories include technical aspects such as resilience, security, and discriminatory risk potential.

Test Results and AI Model Shortage

LatticeFlow published ranking boards showing the results of several AI models being tested. Big tech companies such as Alibaba, Meta, OpenAI, Anthropic, and Mistral all get an average score of more than 0.75. However, some models show flaws in key categories that could risk violating the AI Law.

In terms of discriminatory output, the tool provides a low score to the "GPT-3.5 Turbo" model from OpenAI, which only gets a value of 0.46. In fact, Alibaba Cloud's "Qwen1.5 72B Chat" model gets a lower score, namely 0.37. This discriminatory output reflects human bias related to gender, race, and other aspects, which can arise when the AI model is asked to produce certain content.

In addition, in the "prompt hijacking" category, namely the type of cyberattack in which hackers disguise malicious prompts as legitimate prompts to steal sensitive information, Meta's "Llama 2 13B Chat" model received a low score of 0.42, while the "8x7B Instruct" model from Mistral got a lower score, namely 0.38.

Claude 3 Opus, a model developed by Anthropic with Google support, got the highest score with an average value of 0.89 in various categories, making it the most resilient model in terms of compliance with security regulations and technical resilience.

Great Sanction Potential

This checking tool is designed in accordance with the text of the AI Law and is expected to continue to be updated in line with the implementation of additional enforcement measures. According to LatticeFlow CEO and co-founder Petar Tsankov, the test results provide an overview of where companies need to increase their focus to ensure compliance with the AI Law.

He stated that although the results were positive overall, there was still a "gap" that needed to be corrected so that this generative AI model could meet regulatory standards.

"EU is still perfecting compliance benchmarks, but we can already see some shortcomings in existing AI models," said Tsankov. With a greater focus on optimization for compliance, we believe model providers can prepare well to meet regulatory requirements.

AU Law Compliance Check Tool Reveals Big Tech Technology Weaknesses In Meeting European Union Regulations

SEE ALSO:

Pentingnya Fungsi Bargainser PLN untuk Sistem Kelistrikan di Rumah atau Gedung

Begini Cara Mengaktifkan Fitur Getaran Adaptif di Android 15

Aplikasi Google Meet (Original) Sudah Tidak Berfungsi

Huawei dan XL Axiata Luncurkan Jaringan Inti dengan Bandwidth Tinggi

YouTube Uji Coba Premium Lite, Paket dengan Iklan Terbatas

SEE ALSO:

Related News :