Nateq - (ناطق) is an innovative application that leverages artificial intelligence solutions, specifically designed for Arabic language processing. Tailored for Arabicthon 2023, ناطق combines the power of OpenAI's language models and Google Cloud Vision API to offer advanced text-to-speech capabilities.
- Fast Text-to-Speech Conversion: Utilizes OpenAI's powerful language models to generate high-quality Arabic speech from text.
- Enhanced Audio: Applies audio processing techniques to improve the quality of generated speech.
- Interactive Interface: Provides a user-friendly interface for users to input text, upload images, and record their own pronunciations.
To run Nateq locally, follow these steps:
-
Install the required Python packages:
pip install gradio pydub librosa google-cloud-vision numpy
-
Run the (Nateq) application:
python Nateq.py
(Nateq) offers multiple functionalities through its intuitive interface:
Faster Model: Use the faster model for quick text-to-speech conversion. Better Model: Opt for the better model when quality is a priority. Record!: Record your own pronunciation and receive a thank-you message.
If you'd like to contribute to (Nateq), feel free to open an issue or submit a pull request. Your feedback and enhancements are highly appreciated.