arxiv:2505.19912
Javier Marin
AI-that-works
·
AI & ML interests
AI implementation consultant and researcher. Solving systematic operational breakdowns.
Recent Activity
updated a dataset 18 days ago
cert-framework/human-confabulation-benchmark published a dataset 19 days ago
cert-framework/human-confabulation-benchmark new activity about 1 month ago
cert-framework/cert-hallucination-demo:CERT Hallucination Detection Without Another LLM