{"id":24583762,"url":"https://github.com/assemblyai-solutions/bulk-csv-lemur-processor-streamlit","last_synced_at":"2025-03-17T17:25:11.306Z","repository":{"id":241811285,"uuid":"807245942","full_name":"AssemblyAI-Solutions/bulk-csv-lemur-processor-streamlit","owner":"AssemblyAI-Solutions","description":null,"archived":false,"fork":false,"pushed_at":"2025-01-22T21:11:21.000Z","size":13,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":3,"default_branch":"main","last_synced_at":"2025-01-22T22:19:03.370Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/AssemblyAI-Solutions.png","metadata":{"files":{"readme":"readme.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2024-05-28T18:22:33.000Z","updated_at":"2025-01-22T21:11:25.000Z","dependencies_parsed_at":"2024-05-30T06:01:55.946Z","dependency_job_id":"b70f201a-9ece-4b1e-a94d-a156d684a982","html_url":"https://github.com/AssemblyAI-Solutions/bulk-csv-lemur-processor-streamlit","commit_stats":null,"previous_names":["assemblyai-solutions/bulk-csv-lemur-processor-streamlit"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/AssemblyAI-Solutions%2Fbulk-csv-lemur-processor-streamlit","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/AssemblyAI-Solutions%2Fbulk-csv-lemur-processor-streamlit/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/AssemblyAI-Solutions%2Fbulk-csv-lemur-processor-streamlit/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/AssemblyAI-Solutions%2Fbulk-csv-lemur-processor-streamlit/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/AssemblyAI-Solutions","download_url":"https://codeload.github.com/AssemblyAI-Solutions/bulk-csv-lemur-processor-streamlit/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":244076047,"owners_count":20394047,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2025-01-24T04:53:40.569Z","updated_at":"2025-03-17T17:25:11.282Z","avatar_url":"https://github.com/AssemblyAI-Solutions.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# AssemblyAI LeMUR CSV Processor\n\nThis repository contains a Streamlit application that processes a CSV file using AssemblyAI's LeMUR (Language Model Understanding and Reasoning) API. The application allows you to input your AssemblyAI API key, upload a CSV file, provide a custom prompt, and then process the CSV file using LeMUR. The processed results are then available for download as a new CSV file.\n\n## Features\n\n- Input your AssemblyAI API key for authentication\n- Upload a CSV file containing transcript IDs\n- Provide a custom prompt for LeMUR processing\n- Process the CSV file using LeMUR API\n- Download the processed results as a new CSV file\n\n## Requirements\n\n- Python 3.6 or higher\n- Streamlit\n- AssemblyAI Python SDK\n\n## Installation\n\n1. Clone the repository:\n\n```\ngit clone https://github.com/your-username/assemblyai-lemur-csv-processor.git\n```\n\n2. Change into the project directory:\n\n```\ncd assemblyai-lemur-csv-processor\n```\n\n3. Install the required dependencies:\n\n```\npip install -r requirements.txt\n```\n\n## Usage\n\n1. Run the Streamlit application:\n\n```\nstreamlit run app.py\n```\n\n2. Open the application in your web browser using the provided URL.\n\n3. Enter your AssemblyAI API key in the designated input field.\n\n4. Upload a CSV file containing the transcript IDs you want to process. The CSV file should have a column named \"transcriptid\".\n\n5. Provide a custom prompt for LeMUR processing in the text area. The prompt will be used to guide the LeMUR model in analyzing the transcripts.\n\n6. Click the \"Process CSV\" button to start processing the CSV file using LeMUR.\n\n7. Once the processing is complete, a success message will be displayed, and a \"Download CSV File\" link will appear.\n\n8. Click on the \"Download CSV File\" link to download the processed results as a new CSV file. The downloaded file will contain the original columns from the input CSV, along with two new columns: \"lemur_response\" (containing the LeMUR response) and \"number_occurred\" (indicating the number of \"yes\" answers in the LeMUR response).\n\n## Customization\n\nYou can customize the application by modifying the `app.py` file. The main components of the application are:\n\n- `parse_json_from_resp`: Extracts the JSON array from the LeMUR response string.\n- `count_yes`: Counts the number of \"yes\" answers in the LeMUR response.\n- `process_row`: Processes a single row of the CSV file using LeMUR.\n- `process_csv`: Processes the entire CSV file using multithreading.\n- `download_csv`: Generates the download link for the processed CSV file.\n\nFeel free to modify these functions to suit your specific requirements.\n\n## Acknowledgements\n\n- [AssemblyAI](https://www.assemblyai.com/) for providing the LeMUR API\n- [Streamlit](https://streamlit.io/) for the web application framework\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fassemblyai-solutions%2Fbulk-csv-lemur-processor-streamlit","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fassemblyai-solutions%2Fbulk-csv-lemur-processor-streamlit","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fassemblyai-solutions%2Fbulk-csv-lemur-processor-streamlit/lists"}