{"id":25531701,"url":"https://github.com/headless-start/pdf-summary-app","last_synced_at":"2026-04-28T12:31:27.906Z","repository":{"id":276802196,"uuid":"929766541","full_name":"headless-start/pdf-summary-app","owner":"headless-start","description":"This repository contains a PDF Summary Web Application hosted on Streamlit Cloud.","archived":false,"fork":false,"pushed_at":"2025-02-10T16:52:23.000Z","size":8,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-08-23T12:39:04.955Z","etag":null,"topics":["deepseek-r1","langchain","llm","llmapi","openai","openai-api","pdf","python3","streamlit","streamlitcloud"],"latest_commit_sha":null,"homepage":"https://pdfdeepv1.streamlit.app/","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/headless-start.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null}},"created_at":"2025-02-09T10:57:07.000Z","updated_at":"2025-02-10T16:52:27.000Z","dependencies_parsed_at":"2025-02-10T14:50:20.126Z","dependency_job_id":"d7158436-adf8-484e-a122-e8d7e2914630","html_url":"https://github.com/headless-start/pdf-summary-app","commit_stats":null,"previous_names":["headless-start/pdf-summary-app"],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/headless-start/pdf-summary-app","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/headless-start%2Fpdf-summary-app","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/headless-start%2Fpdf-summary-app/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/headless-start%2Fpdf-summary-app/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/headless-start%2Fpdf-summary-app/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/headless-start","download_url":"https://codeload.github.com/headless-start/pdf-summary-app/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/headless-start%2Fpdf-summary-app/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":279183246,"owners_count":26121357,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","status":"online","status_checked_at":"2025-10-16T02:00:06.019Z","response_time":53,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["deepseek-r1","langchain","llm","llmapi","openai","openai-api","pdf","python3","streamlit","streamlitcloud"],"created_at":"2025-02-20T01:19:34.667Z","updated_at":"2025-10-16T11:22:17.553Z","avatar_url":"https://github.com/headless-start.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# 📄 PDF Summary Tool  \n\n## 📌 Project Overview  \nThis project demonstrates the **loading, preprocessing, and summarization** of text from **PDF files** using **Streamlit** and **LLM APIs like (Deepseek r1,o3-mini)**. The application allows users to upload a PDF, extract its text, and generate a concise summary. The summary can be downloaded as a `.txt` file for offline use.  \n\n---\n\n## 🚀 Key Features  \n1. **PDF Summarization**  \n   - Users can upload a PDF file (up to 200MB) and get the summary of the document.  \n\n---\n\n## 🔍 How It Works  \n1. **Upload a PDF**:  \n   - Users upload a PDF file using the file uploader in the app.  \n2. **Extract Text**:  \n   - The app extracts text from the PDF using the `PyPDF2` library.  \n3. **Generate Summary**:  \n   - The extracted text is sent to the LLM API (Deepseek r1, o3-mini), which generates a summary.  \n4. **Display and Download**:  \n   - The summary is displayed on the app, and users can download it as a `.txt` file.  \n\n**Check Demo Here**:  \n[![Open in Streamlit](https://static.streamlit.io/badges/streamlit_badge_black_white.svg)](https://pdfdeepv1.streamlit.app/)  \n\n---\n\n## 🛠 System Requirements  \n\n### Dependencies  \n- Python 3.8+  \n- Libraries: `streamlit`, `PyPDF2`, `openai`  \n- Hardware: CPU (GPU not required)  \n\n---\n\n## 📄 License  \nThis project is licensed under the MIT License. See the [LICENSE](LICENSE) file for details.  \n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fheadless-start%2Fpdf-summary-app","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fheadless-start%2Fpdf-summary-app","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fheadless-start%2Fpdf-summary-app/lists"}