https://github.com/muhammadshavaiz/searchenginejava
https://github.com/muhammadshavaiz/searchenginejava
Last synced: 2 months ago
JSON representation
- Host: GitHub
- URL: https://github.com/muhammadshavaiz/searchenginejava
- Owner: MuhammadShavaiz
- Created: 2024-05-21T16:10:16.000Z (12 months ago)
- Default Branch: master
- Last Pushed: 2024-05-21T18:41:47.000Z (12 months ago)
- Last Synced: 2025-01-11T03:44:53.055Z (4 months ago)
- Language: Java
- Size: 589 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# CNN News Search Engine
This project is a search engine developed in Java using object-oriented programming principles. The search engine is designed to index and retrieve CNN news articles based on user queries, providing precise and relevant search results.
## Features
- **Data Preprocessing:** Utilizes techniques such as tokenization, stop word removal, and stemming to clean and normalize text data.
- **Indexing:** Implements both forward and backward indexing mechanisms for efficient search operations.
- **Lexicon Management:** Builds lexicons to aid in filtering and indexing relevant words.
- **User Interface:** Provides a simple command-line interface for users to input queries and receive search results.
- **Error Handling:** Incorporates robust error handling mechanisms to ensure smooth user experience.## Installation
To run the search engine project, follow these steps:
1. Clone the repository to your local machine:
```
git clone
```2. Navigate to the project directory:
```
cd search-engine-project
```3. Compile the Java files:
```
javac *.java
```4. Run the main class to start the search engine:
```
java SearchEngineMain
```## Usage
Once the search engine is running, follow the on-screen instructions to perform searches and view the results. Input your query and press Enter to see the relevant news articles.
## Dependencies
- Java Development Kit (JDK)
- Apache Lucene
- Stanford NLP
- JUnit (for testing)## Contributors
- [Azan Siddiq](https://github.com/azaans23)
- [Taha Mukhtiar Janjua](https://github.com/tahamukhtiar)
- [Abdullah Aleem Gujjar](https://github.com/abdullahaleemgujjar)## License
This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.
## Acknowledgments
- Special thanks to [NoOne.org](https://noone.org) for their support and inspiration.
---
Feel free to contribute with additional information.