Automating Behavioral Testing in Machine Translation

A.I. Black GuyNovember 16, 2023

0 0 1 minute read

Automating Behavioral Testing in Machine Translation

Behavioral testing in NLP allows fine-grained evaluation of systems by examining their linguistic capabilities through the analysis of input-output behavior. Unfortunately, existing work on behavioral testing in Machine Translation (MT) is currently restricted to largely handcrafted tests covering a limited range of capabilities and languages. To address this limitation, we propose using Large Language Models (LLMs) to generate a diverse set of source sentences tailored to test the behavior of MT models in a range of situations. We can then verify whether the MT model exhibits the expected behavior through matching candidate sets that are also generated using LLMs. Our approach aims to make behavioral testing of MT systems practical while requiring only minimal human effort. In our experiments, we apply our proposed evaluation framework to assess multiple available MT systems, revealing that while in general pass rates follow the trends observable from traditional accuracy-based metrics, our method was able to uncover several important differences and potential bugs that go unnoticed when relying only on accuracy.

Source link

Automating Behavioral Testing in Machine Translation

Related

A.I. Black Guy

Leave a Reply Cancel reply

Project Mugetsu Legendary Orb Guide – Ultimate Reroll Item

WWE SuperCard QR Codes – 2023!

Bloodtide Secret Codes – Bunker, Vault, and Subway

Widgetable APK/iOS + MOD 1.4.030 (Premium) Download

Camp Buddy MOD APK/iOS v2.2.4 (Unlock All Characters)

Related

A.I. Black Guy

New Report Shows That More People Are Now Getting News Content via TikTok

Members of Congress urge revisions to Treasury's ‘unworkable’ digital asset tax rules

Related Articles

11 Artificial Intelligence AI Tools That You Should Checkout (Feb 2023)

Confidence-Building Measures for Artificial Intelligence: Workshop proceedings

Policy Learning with Large World Models: Advancing Multi-Task Reinforcement Learning Efficiency and Performance

The Best Optimization Algorithm for Your Neural Network | by Riccardo Andreoni | Oct, 2023

Leave a Reply Cancel reply