|
--- |
|
library_name: transformers.js |
|
base_model: |
|
- prithivMLmods/Common-Voice-Gender-Detection |
|
license: apache-2.0 |
|
language: |
|
- en |
|
pipeline_tag: audio-classification |
|
tags: |
|
- Gender-Detection |
|
--- |
|
|
|
# Common-Voice-Gender-Detection (ONNX) |
|
|
|
This is an ONNX version of [prithivMLmods/Common-Voice-Gender-Detection](https://huggingface.co/prithivMLmods/Common-Voice-Gender-Detection). It was automatically converted and uploaded using [this space](https://huggingface.co/spaces/onnx-community/convert-to-onnx). |
|
|
|
|
|
> **Common-Voice-Gender-Detection** is a fine-tuned version of `facebook/wav2vec2-base-960h` for **binary audio classification**, specifically trained to detect speaker gender as **female** or **male**. This model leverages the `Wav2Vec2ForSequenceClassification` architecture for efficient and accurate voice-based gender classification. |
|
|
|
> [!note] |
|
Wav2Vec2: Self-Supervised Learning for Speech Recognition : [https://arxiv.org/pdf/2006.11477](https://arxiv.org/pdf/2006.11477) |
|
|
|
--- |
|
|
|
## Intended Use |
|
|
|
`Common-Voice-Gender-Detection` is designed for: |
|
|
|
* **Speech Analytics** β Assist in analyzing speaker demographics in call centers or customer service recordings. |
|
* **Conversational AI Personalization** β Adjust tone or dialogue based on gender detection for more personalized voice assistants. |
|
* **Voice Dataset Curation** β Automatically tag or filter voice datasets by speaker gender for better dataset management. |
|
* **Research Applications** β Enable linguistic and acoustic research involving gender-specific speech patterns. |
|
* **Multimedia Content Tagging** β Automate metadata generation for gender identification in podcasts, interviews, or video content. |