oumi-ai
/

HallOumi-8B-classifier

@@ -2,7 +2,6 @@
 library_name: transformers
 license: cc-by-nc-4.0
 datasets:
-- oumi-ai/oumi-anli-subset
 - oumi-ai/oumi-c2d-d2c-subset
 - oumi-ai/oumi-synthetic-claims
 - oumi-ai/oumi-synthetic-document-claims
@@ -23,16 +22,16 @@ base_model:
 <!-- Provide a quick summary of what the model is/does. -->
-Introducing **HallOumi-8B-classifier**, a **SOTA hallucination detection model**, outperforming DeepSeek R1, OpenAI o1, Google Gemini 1.5 Pro, and Anthropic Sonnet 3.5 at only **8 billion parameters!**
-Give HallOumi a try now!
-* Demo: https://oumi.ai/halloumi-demo
-* Github: https://github.com/oumi-ai/oumi/tree/main/configs/projects/halloumi
 | Model                 | Balanced Accuracy | Macro F1 Score | Open Source? | Model Size |
 | --------------------- | ----------------- | --------------------------------------- | ------------ | ---------- |
-| **HallOumi-8B**       | **73.0% ± 2.2%**  | **75.1% ± 2.2%**                        | ✔️           | 8B         |
 | Anthropic Sonnet 3.5  | 67.3% ± 2.7%      | 69.6% ± 2.8%                            | ❌            | ??         |
 | OpenAI o1-preview     | 64.5% ± 2.0%      | 65.9% ± 2.3%                            | ❌            | ??         |
 | DeepSeek R1           | 60.7% ± 2.1%      | 61.6% ± 2.5%                            | ✔️           | 671B       |
@@ -47,7 +46,7 @@ For example, when given one or more context documents, as well as an AI-generate
 * A determination whether that particular statement is **supported or unsupported** by the provided context.
 * An **explanation** describing why a particular claim is supported or unsupported.
-**HallOumi-8B-classifier** is trained with similar data to HallOumi-8B but is instead trained as a classifier rather than a generative model.
 * ✔️ Fast
 * ✔️ Per-claim support (must call once per claim)
 * ❌ No Explanations
@@ -79,7 +78,7 @@ however, this is not enough, as we have to be capable of doing these things in a
 - **Language(s) (NLP):** English
 - **License:** [CC-BY-NC-4.0](https://creativecommons.org/licenses/by-nc/4.0/deed.en)
 - **Finetuned from model:** [Llama-3.1-8B-Instruct](https://huggingface.co/meta-llama/Llama-3.1-8B-Instruct)
-- **Demo:** [HallOumi Demo](https://oumi.ai/halloumi)
 ---
@@ -88,7 +87,7 @@ however, this is not enough, as we have to be capable of doing these things in a
 <!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
 Use to verify claims/detect hallucinations in scenarios where a known source of truth is available.
-Demo: https://oumi.ai/halloumi
 ## Out-of-Scope Use
@@ -125,11 +124,11 @@ Eval notebook: Coming Soon
 <!-- Total emissions (in grams of CO2eq) and additional considerations, such as electricity usage, go here. Edit the suggested text below accordingly -->
-- **Hardware Type:** H100
-- **Hours used:** 32 (4 * 8 GPUs)
 - **Cloud Provider:** Google Cloud Platform
 - **Compute Region:** us-east5
-- **Carbon Emitted:** 2.8 kg
 ## Citation
@@ -137,11 +136,11 @@ Eval notebook: Coming Soon
 ```
 @misc{oumiHalloumi8BClassifier,
-  author = {Jeremiah Greer},
   title = {HallOumi-8B-classifier},
   month = {March},
   year = {2025},
-  url = {https://huggingface.co/oumi-ai/HallOumi-8B}
 }
 @software{oumi2025,

 library_name: transformers
 license: cc-by-nc-4.0
 datasets:
 - oumi-ai/oumi-c2d-d2c-subset
 - oumi-ai/oumi-synthetic-claims
 - oumi-ai/oumi-synthetic-document-claims
 <!-- Provide a quick summary of what the model is/does. -->
+Introducing **HallOumi-8B-classifier**, a _fast_ **SOTA hallucination detection model**, outperforming DeepSeek R1, OpenAI o1, Google Gemini 1.5 Pro, and Anthropic Sonnet 3.5 at only 8 billion parameters!
+<!-- Give HallOumi a try now! -->
+<!-- * Demo: https://oumi.ai/halloumi-demo -->
+<!-- * Github: https://github.com/oumi-ai/oumi/tree/main/configs/projects/halloumi -->
 | Model                 | Balanced Accuracy | Macro F1 Score | Open Source? | Model Size |
 | --------------------- | ----------------- | --------------------------------------- | ------------ | ---------- |
+| **HallOumi-8B-classifier**       | **76.8% ± 2.0%**  | **78.5% ± 2.1%**                        | ✔️           | 8B         |
 | Anthropic Sonnet 3.5  | 67.3% ± 2.7%      | 69.6% ± 2.8%                            | ❌            | ??         |
 | OpenAI o1-preview     | 64.5% ± 2.0%      | 65.9% ± 2.3%                            | ❌            | ??         |
 | DeepSeek R1           | 60.7% ± 2.1%      | 61.6% ± 2.5%                            | ✔️           | 671B       |
 * A determination whether that particular statement is **supported or unsupported** by the provided context.
 * An **explanation** describing why a particular claim is supported or unsupported.
+**HallOumi-8B-classifier**, the hallucination classification model built with Oumi, is an end-to-end classification system that enables *fast and accurate* assessment of the hallucination probability of any written content (AI or human-generated).
 * ✔️ Fast
 * ✔️ Per-claim support (must call once per claim)
 * ❌ No Explanations
 - **Language(s) (NLP):** English
 - **License:** [CC-BY-NC-4.0](https://creativecommons.org/licenses/by-nc/4.0/deed.en)
 - **Finetuned from model:** [Llama-3.1-8B-Instruct](https://huggingface.co/meta-llama/Llama-3.1-8B-Instruct)
+<!-- - **Demo:** [HallOumi Demo](https://oumi.ai/halloumi) -->
 ---
 <!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
 Use to verify claims/detect hallucinations in scenarios where a known source of truth is available.
+<!-- Demo: https://oumi.ai/halloumi -->
 ## Out-of-Scope Use
 <!-- Total emissions (in grams of CO2eq) and additional considerations, such as electricity usage, go here. Edit the suggested text below accordingly -->
+- **Hardware Type:** A100-80GB
+- **Hours used:** 1.5 (4 * 8 GPUs)
 - **Cloud Provider:** Google Cloud Platform
 - **Compute Region:** us-east5
+- **Carbon Emitted:** 0.15 kg
 ## Citation
 ```
 @misc{oumiHalloumi8BClassifier,
+  author = {Achlioptas Panos, Jeremiah Greer, Aisopos Kostas, Schuler A. Michael, Elachqar Oussama, Koukoumidis Emmanouil},
   title = {HallOumi-8B-classifier},
   month = {March},
   year = {2025},
+  url = {https://huggingface.co/oumi-ai/HallOumi-8B-classifier}
 }
 @software{oumi2025,