Machine Translation Hallucination Detection for Low and High Resource Languages using Large Language Models Paper • 2407.16470 • Published Jul 23, 2024
Retrieval or Representation? Reassessing Benchmark Gaps in Multilingual and Visually Rich RAG Paper • 2603.04238 • Published 9 days ago
Health AI Developer Foundations (HAI-DEF) Collection Groups models released for use in health AI by Google. Read more about HAI-DEF at http://goo.gle/hai-def • 22 items • Updated 1 day ago • 203
PubMedQA: A Dataset for Biomedical Research Question Answering Paper • 1909.06146 • Published Sep 13, 2019 • 4
🪩 DISCO Collection Document Intelligence Suite for COmparative Evaluations • 8 items • Updated 4 days ago
How Can We Diagnose and Treat Bias in Large Language Models for Clinical Decision-Making? Paper • 2410.16574 • Published Oct 21, 2024
🩺 Counterfactual Patient Variations (CPV) Collection CPV constructs counterfactual patient vignettes from medical QA datasets by varying demographic attributes (gender, ethnicity) • 4 items • Updated 5 days ago
How Can We Diagnose and Treat Bias in Large Language Models for Clinical Decision-Making? Paper • 2410.16574 • Published Oct 21, 2024 • 1