NLP & Portuguese–Chinese Machine Translation Lab (NLP2CT)
Mar 2024 – Present
Research Assistant — University of Macau
Macau SAR, China
- Implemented and evaluated Transformer-based models (BERT, RoBERTa, GPT-style) for multilingual machine translation; developed test plans, triaged model failures, and iterated on configurations across low-resource language pairs.
- Built automated evaluation pipelines (BLEU, ROUGE, WER, BERTScore) into reproducible, containerized workflows using Git and Docker, with experiment tracking via MLflow and Weights & Biases.
- Conceived and led IndicDetect, a cross-lingual benchmark for LLM-generated text detection across Hindi, Telugu, and Tamil; drove dataset construction, evaluation framework design, and analysis of zero-shot and fine-tuned detector performance.
- Co-authored a manuscript submitted to ACL Rolling Review (ARR) 2026, contributing experimental design, writing, and result analysis.