{"id":22589098,"url":"https://github.com/vidhi1290/llm---detect-ai-generated-text","last_synced_at":"2025-10-19T15:03:55.734Z","repository":{"id":205250361,"uuid":"713785080","full_name":"Vidhi1290/LLM---Detect-AI-Generated-Text","owner":"Vidhi1290","description":"AI-Generated Text Detection: A BERT-powered solution for accurately identifying AI-generated text. Seamlessly integrated, highly accurate, and user-friendly.🚀","archived":false,"fork":false,"pushed_at":"2024-07-11T12:14:01.000Z","size":53,"stargazers_count":57,"open_issues_count":0,"forks_count":9,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-04-10T21:45:43.830Z","etag":null,"topics":["ai-generated","bert","bert-model","detection-algorithm","kaggle","kaggle-competition","llm","machine-learning","natural-language-processing","nlp"],"latest_commit_sha":null,"homepage":"","language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/Vidhi1290.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2023-11-03T08:26:39.000Z","updated_at":"2025-03-29T17:39:32.000Z","dependencies_parsed_at":null,"dependency_job_id":"8b660fe0-ebe4-4067-820a-0c16aa6183e4","html_url":"https://github.com/Vidhi1290/LLM---Detect-AI-Generated-Text","commit_stats":null,"previous_names":["vidhi1290/llm---detect-ai-generated-text"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Vidhi1290%2FLLM---Detect-AI-Generated-Text","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Vidhi1290%2FLLM---Detect-AI-Generated-Text/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Vidhi1290%2FLLM---Detect-AI-Generated-Text/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Vidhi1290%2FLLM---Detect-AI-Generated-Text/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/Vidhi1290","download_url":"https://codeload.github.com/Vidhi1290/LLM---Detect-AI-Generated-Text/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":248305874,"owners_count":21081562,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["ai-generated","bert","bert-model","detection-algorithm","kaggle","kaggle-competition","llm","machine-learning","natural-language-processing","nlp"],"created_at":"2024-12-08T08:12:16.491Z","updated_at":"2025-10-19T15:03:50.693Z","avatar_url":"https://github.com/Vidhi1290.png","language":"Jupyter Notebook","funding_links":[],"categories":[],"sub_categories":[],"readme":"# AI-Generated Text Detection using BERT\n\nWelcome to our AI-Generated Text Detection project! In this repository, we present a robust solution for detecting AI-generated text using BERT, a cutting-edge natural language processing model. Whether you're a researcher, developer, or a curious enthusiast, this project empowers you to explore, understand, and combat AI-generated content effectively.\n\n## Table of Contents\n\n- [Introduction](#introduction)\n- [Features](#features)\n- [Getting Started](#getting-started)\n- [How It Works](#how-it-works)\n- [Contributing](#contributing)\n- [License](#license)\n\n## Introduction\n\nAI-generated content is becoming increasingly sophisticated, making it challenging to distinguish between genuine and computer-generated text. Our project aims to tackle this issue by leveraging the power of BERT (Bidirectional Encoder Representations from Transformers) to identify and flag AI-generated text segments. Whether you're dealing with chatbots, articles, or social media posts, our solution offers accurate detection, ensuring the authenticity of digital content.\n\n## Features\n\n- **BERT-Powered Detection:** We utilize state-of-the-art BERT models to analyze the semantic context and linguistic nuances, enabling precise identification of AI-generated text.\n- **Effortless Integration:** Seamlessly integrate our solution into your existing applications or workflows, ensuring hassle-free implementation for developers and researchers.\n- **High Accuracy:** Our model is meticulously trained and fine-tuned to achieve high accuracy, minimizing false positives and false negatives for reliable results.\n- **User-Friendly Interface:** With intuitive interfaces and clear instructions, users can easily navigate and utilize the detection tool without any technical expertise.\n\n## Getting Started\n\nFollow these simple steps to get started with our AI-Generated Text Detection tool:\n\n1. **Clone the Repository:**\n   ```bash\n   git clone https://github.com/your-username/ai-generated-text-detection.git\n   cd ai-generated-text-detection\n   ```\n   \n- **Access the generated submission.csv file to explore the detected AI-generated text segments and their respective confidence scores.**\n\n## How It Works\n\nOur solution follows a comprehensive approach to AI-generated text detection:\n\n**Data Preprocessing:** We clean and preprocess the textual data, removing noise and irrelevant information to enhance the accuracy of our model.\n\n**BERT Tokenization:** Leveraging the BERT tokenizer, we encode the preprocessed text, preparing it for input into our detection model.\n\n**Model Training:** Using a BERT-based sequence classification model, we train the system to distinguish between genuine and AI-generated text with a high degree of accuracy.\n\n**Predictions:** Once trained, the model generates predictions for test data, highlighting potential AI-generated content segments.\n\n**Result Analysis:** The results are saved in a CSV file, allowing users to review and analyze the detected segments along with their confidence scores.\n\n## Contributing\n\nWe welcome contributions from the community! Whether you're a seasoned developer, a data science enthusiast, or a domain expert, your insights and expertise can enhance our project. \n\n🚀 **Connect With Me:**\n- LinkedIn: [LinkedIn Profile](https://www.linkedin.com/in/vidhi-waghela-434663198/)\n- Kaggle: [Kaggle Profile](https://www.kaggle.com/vidhikishorwaghela)\n- GitHub: [GitHub Profile](https://github.com/Vidhi1290)\n\nIf you find this project interesting or helpful, don't hesitate to follow me for more exciting updates and projects! Let's learn and grow together! 🌟\n\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fvidhi1290%2Fllm---detect-ai-generated-text","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fvidhi1290%2Fllm---detect-ai-generated-text","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fvidhi1290%2Fllm---detect-ai-generated-text/lists"}