Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/tritonix711/ai-content-verifier
AI Content Verifier is a tool that finds out if text is written by AI or humans. It uses machine learning and natural language processing to give clear results and confidence scores. With an easy-to-use interface, it helps everyone from researchers to content creators check if the content is real or not.
https://github.com/tritonix711/ai-content-verifier
git machine-learning nlp nltk numpy pandas python scikit-learn spacy tkinter
Last synced: about 14 hours ago
JSON representation
AI Content Verifier is a tool that finds out if text is written by AI or humans. It uses machine learning and natural language processing to give clear results and confidence scores. With an easy-to-use interface, it helps everyone from researchers to content creators check if the content is real or not.
- Host: GitHub
- URL: https://github.com/tritonix711/ai-content-verifier
- Owner: Tritonix711
- License: mit
- Created: 2024-09-14T14:05:53.000Z (4 months ago)
- Default Branch: main
- Last Pushed: 2024-09-28T15:06:18.000Z (3 months ago)
- Last Synced: 2025-01-07T18:20:55.774Z (2 days ago)
- Topics: git, machine-learning, nlp, nltk, numpy, pandas, python, scikit-learn, spacy, tkinter
- Language: Python
- Homepage:
- Size: 26.4 KB
- Stars: 2
- Watchers: 3
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
![Screenshot 2024-09-28 165852](https://github.com/user-attachments/assets/cc59d92e-d123-4221-8df4-68fcd5fa09ae)
# π AI Content Verifier
## π Project Overview
**AI Content Verifier** is a strong tool that helps tell apart text written by humans and text created by AI. As AI gets better at writing like humans, itβs important to check if the text is real. This tool is helpful for things like checking content, academic honesty, and keeping information accurate.The project uses smart machine learning methods and natural language processing to look at written data, giving users clear results and confidence scores. The easy-to-use graphical interface makes it simple for researchers, teachers, and content creators. With adjustable settings and detailed logs, users can improve detection and keep track of their results easily.
## β Key Features
- **AI Detection**: Accurately classifies text as AI-generated or human-written, helping users verify content authenticity.
- **Confidence Scoring**: Provides confidence scores for each classification to assess the reliability of results.
- **User-Friendly GUI**: An intuitive graphical user interface simplifies content analysis, making it easy for users without technical expertise.
- **Customizable Settings**: Allows users to adjust parameters and configurations to enhance detection performance based on their specific needs.
- **Robust Logging**: Comprehensive logs of predictions, training processes, and user interactions to facilitate better analysis.
- **Cross-Platform Compatibility**: Runs on Windows, macOS, and Linux, ensuring accessibility to a wide range of users.
- **Efficient Data Handling**: Utilizes CSV files for dataset management, simplifying data import, export, and manipulation.
- **Scalable Model**: Designed to accommodate additional training data and improvements, ensuring the verifier evolves with AI advancements.## π οΈ Technologies Used
- **Programming Language**: Python
- **Machine Learning Libraries**:
- scikit-learn: For implementing machine learning algorithms and preprocessing.
- pandas: For data manipulation and analysis.
- numpy: For numerical computations.
- **Natural Language Processing**:
- NLTK: For text processing and linguistic analysis.
- SpaCy: For advanced NLP tasks and efficient text processing.
- **GUI Framework**:
- Tkinter: For creating the graphical user interface.
- PyQt (optional): An alternative GUI framework.
- **Data Handling**: CSV format for dataset management and storage.
- **Version Control**: Git for source code management.## π Project Structure
Here's the structure of the project directory:```plaintext
AI-Content-Verifier/
βββ .vscode/ # VS Code configuration files
βββ config/ # Configuration files
β βββ config.yaml # Configuration settings
βββ data/ # Data directory
β βββ processed/ # Processed datasets for training
β βββ raw/ # Raw datasets
βββ logs/ # Log files
β βββ training.log # Log of model training processes
βββ src/ # Source code directory
β βββ data/ # Data processing scripts
β β βββ preprocess_data.py # Script for data preprocessing
β βββ modeling/ # Model training and evaluation scripts
β β βββ evaluate_model.py # Script to evaluate the model
β β βββ predict.py # Script for making predictions
β β βββ train_model.py # Script for training the model
β βββ saved_models/ # Directory for saved models
β β βββ best_model.pkl # Best trained model for predictions
β β βββ model_logistic_regression.pkl # Logistic regression model
β β βββ tfidf_vectorizer.pkl # TF-IDF Vectorizer for feature extraction
β βββ utils/ # Utility scripts
β βββ helpers.py # Utility functions and helpers
βββ .gitignore # Git ignore file
βββ content_verifier_model.py # Main logic for content verification
βββ gui.py # GUI interface for the project
βββ LICENSE # License information
βββ predictions_log.txt # Log of predictions made by the model
βββ README.md # Project documentation (this file)
βββ requirements.txt # Required Python packages for the project
βββ test_helpers.py # Unit tests for helper functions
```## π¦ Installation Guide
1. **Clone the Repository**:
Open your terminal or command prompt and run the following command to clone the repository:
```bash
git clone https://github.com/your_username/AI-Content-Verifier.git
cd AI-Content-Verifier
```2. **Install Required Packages**:
Use `pip` to install the necessary dependencies from the `requirements.txt` file:
```bash
pip install -r requirements.txt
```3. **Configure the Application**:
Modify the `config/config.yaml` file to set your preferences for model parameters and other settings.4. **Set Up Data Directories**:
Ensure the raw and processed datasets are placed in the following directories:
- Raw datasets: `data/raw/`
- Processed datasets: `data/processed/`5. **Run the Application**:
To run the GUI application, use the following command:
```bash
python gui.py
```## π Data Model Training
The **AI Content Verifier** uses raw and processed datasets for training. Follow these steps to prepare and train your model:1. **Prepare Datasets**:
- Place the raw and processed datasets in the appropriate directories:
- Raw datasets: `data/raw/`
- Processed datasets: `data/processed/`2. **Train the Model**:
To start training, run the following command:
```bash
python src/modeling/train_model.py
```3. **Monitor Training**:
Logs will be generated in the `logs/` directory to help monitor the training process and performance metrics.4. **Save the Trained Model**:
After training, the model will be saved in the `src/saved_models/` directory for future predictions.Ensure the configurations in `config/config.yaml` are set properly to optimize training performance.
## π€ Contributing
We welcome contributions to improve the **AI Content Verifier**! Here's how you can contribute:1. **Fork the Repository**:
- Click the "Fork" button at the top right of the project page on GitHub.2. **Clone Your Fork**:
```bash
git clone https://github.com/your-username/AI-Content-Verifier.git
cd AI-Content-Verifier
```3. **Create a New Branch**:
- Create a branch for your feature or bug fix:
```bash
git checkout -b feature-or-bugfix-name
```4. **Make Your Changes**:
- Implement your changes.5. **Commit Your Changes**:
```bash
git add .
git commit -m "Description of changes"
```6. **Push to Your Branch**:
```bash
git push origin feature-or-bugfix-name
```7. **Submit a Pull Request**:
- Go to the original repository on GitHub and submit a pull request with a clear description.**Guidelines**:
- Follow the existing code style.
- Write clear commit messages.
- Test your changes before submitting a pull request.Thank you for contributing!
## π License
This project is licensed under the MIT License. See the [LICENSE](LICENSE) file for more details.## π§ Contact
For questions or support, please reach out via the contact methods on my GitHub profile. The email address in the GUI (`[email protected]`) is not real and is used for demonstration purposes only, so please don't send emails to that address.