Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/rylei-m/rrd_2024_file_comparison_tool
A webpage that compares many files with a settable percentage yield. The application will provide highlighted visuals of differences across files.
https://github.com/rylei-m/rrd_2024_file_comparison_tool
comparative-analysis comparison django files python3 rdf
Last synced: 21 days ago
JSON representation
A webpage that compares many files with a settable percentage yield. The application will provide highlighted visuals of differences across files.
- Host: GitHub
- URL: https://github.com/rylei-m/rrd_2024_file_comparison_tool
- Owner: rylei-m
- Created: 2024-07-03T16:19:02.000Z (7 months ago)
- Default Branch: main
- Last Pushed: 2024-08-28T05:57:42.000Z (5 months ago)
- Last Synced: 2024-11-08T19:12:45.980Z (3 months ago)
- Topics: comparative-analysis, comparison, django, files, python3, rdf
- Language: Python
- Homepage:
- Size: 53.7 KB
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# File Comparison tool
Developed for RR DonnelleyJuly 2024
## Summary
This application is designed to compare multiple files to identify and highlight differences effectively. Clients will be able to drag and drop or select as many documents as they want to be compared. Key features include the ability to compare entire documents, individual paragraphs, or sentences, with a focus on highlighting changes and similarities. Users can set a percentage threshold of content similarity to determine the likeness across files and utilize keyword detection for more precise comparisons. The application offers various comparison levels and options providing flexibility for different user needs. Users can select specific options for detailed analysis and highlighting differences.The core comparison functionality is built using Python. The backend is built using Django, ensuring robust server-side functionality, while the frontend is developed with React, offering a responsive and interactive user interface.
In the future, the project aims to integrate with PowerStream, enhancing its functionality and providing more seamless document management and comparison capabilities. This integration will allow clients to have a seamless platform for file saving and comparisons.
## Goals
- Multi-File Type Comparison: Develop the application to support comparison of various file types, including RTF, DOCX, PDF, and TXT.
- Drag-and-Drop Functionality: Implement a user-friendly interface allowing clients to drag and drop multiple documents for comparison.
- Detailed Comparison Levels: Enable comparison at different levels, including entire documents, paragraphs, and individual sentences.
- Content Similarity Threshold: Allow users to set a percentage threshold to determine the likeness of content across files.
- Keyword Detection: Integrate keyword detection to enhance the precision of comparisons.
- Highlighting Differences: Develop robust functionality to highlight differences and similarities within the documents effectively.
- Flexible Comparison Options: Provide various comparison options and levels to cater to different user needs and preferences.
- Responsive Frontend: Use React to create a responsive and interactive frontend, ensuring a seamless user experience.
- Robust Backend: Utilize Django for the backend to ensure reliable and efficient server-side operations.
- Integration with PowerStream: Plan for future integration with PowerStream to provide seamless document management and comparison capabilities.
- Comprehensive Testing: Implement thorough testing to ensure the application handles various file types and comparison scenarios accurately and efficiently.## Priorities
- Multi-File Type Comparison
- Detailed Comparison Levels
- Content Similarity Threshold
- Keyword Detection
- Highlighting Differences
- Flexible Comparison Options## Future Implementations
- Powerstream Integration
- Developed UI Interface
- Comparisons Across File Types
- Checking For Blank Pages