-
Truth or Mirage? Towards End-to-End Factuality Evaluation with LLM-OASIS
Paper • 2411.19655 • Published • 20 -
Babelscape/LLM-Oasis_claim_extraction
Viewer • Updated • 81.3k • 40 • 6 -
Babelscape/LLM-Oasis_claim_verification
Viewer • Updated • 2.66k • 49 • 5 -
Babelscape/LLM-Oasis_unfactual_text_generation
Viewer • Updated • 81.2k • 26 • 6
AI & ML interests
Babelscape is a deep tech company founded in 2016 focused on multilingual Natural Language Processing.
Recent Activity
View all activity
-
Truth or Mirage? Towards End-to-End Factuality Evaluation with LLM-OASIS
Paper • 2411.19655 • Published • 20 -
Babelscape/LLM-Oasis_claim_extraction
Viewer • Updated • 81.3k • 40 • 6 -
Babelscape/LLM-Oasis_claim_verification
Viewer • Updated • 2.66k • 49 • 5 -
Babelscape/LLM-Oasis_unfactual_text_generation
Viewer • Updated • 81.2k • 26 • 6
Word Sense Linking is the task designed to identify and disambiguate spans of text to their most suitable senses from a reference inventory.
models 18
Babelscape/Qwen2.5-Math-PRM-7B-PDDL-r
7B • Updated • 12
Babelscape/Qwen2.5-Math-7B-PRM800k-PDDL-r
8B • Updated • 22
Babelscape/Llama-3.1-8B-PRM800k-r
8B • Updated • 9
Babelscape/Llama-3.1-8B-PRM800k-PDDL-r
8B • Updated • 9
Babelscape/Qwen2.5-Math-7B-PRM800k-r
8B • Updated • 9
Babelscape/t5-base-summarization-claim-extractor
0.2B • Updated • 2.04k • 14
Babelscape/wsl-reader-deberta-v3-base
0.2B • Updated • 111 • 4
Babelscape/wsl-retriever-e5-base-v2
Updated • 81 • 3
Babelscape/wsl-retriever-e5-base-v2-wordnet-index
Updated • 42 • 5
Babelscape/wsl-base
Updated • 48 • 3
datasets 16
Babelscape/PDDL2PRM
Updated • 48
Babelscape/wsl
Viewer • Updated • 1.31k • 11 • 7
Babelscape/LLM-Oasis_claim_falsification
Viewer • Updated • 52.4k • 22 • 6
Babelscape/LLM-Oasis_unfactual_text_generation
Viewer • Updated • 81.2k • 26 • 6
Babelscape/LLM-Oasis_paraphrase_generation
Viewer • Updated • 81.3k • 17 • 6
Babelscape/LLM-Oasis_claim_verification
Viewer • Updated • 2.66k • 49 • 5
Babelscape/LLM-Oasis_claim_extraction
Viewer • Updated • 81.3k • 40 • 6
Babelscape/story-summeval
Viewer • Updated • 319 • 19 • 8
Babelscape/ALERT_DPO
Viewer • Updated • 45.7k • 79 • 14
Babelscape/ALERT
Viewer • Updated • 45.7k • 1.3k • 16