An open API service indexing awesome lists of open source software.

https://github.com/muhammadshavaiz/searchenginejava


https://github.com/muhammadshavaiz/searchenginejava

Last synced: 2 months ago
JSON representation

Awesome Lists containing this project

README

        

# CNN News Search Engine

This project is a search engine developed in Java using object-oriented programming principles. The search engine is designed to index and retrieve CNN news articles based on user queries, providing precise and relevant search results.

## Features

- **Data Preprocessing:** Utilizes techniques such as tokenization, stop word removal, and stemming to clean and normalize text data.
- **Indexing:** Implements both forward and backward indexing mechanisms for efficient search operations.
- **Lexicon Management:** Builds lexicons to aid in filtering and indexing relevant words.
- **User Interface:** Provides a simple command-line interface for users to input queries and receive search results.
- **Error Handling:** Incorporates robust error handling mechanisms to ensure smooth user experience.

## Installation

To run the search engine project, follow these steps:

1. Clone the repository to your local machine:

```
git clone
```

2. Navigate to the project directory:

```
cd search-engine-project
```

3. Compile the Java files:

```
javac *.java
```

4. Run the main class to start the search engine:

```
java SearchEngineMain
```

## Usage

Once the search engine is running, follow the on-screen instructions to perform searches and view the results. Input your query and press Enter to see the relevant news articles.

## Dependencies

- Java Development Kit (JDK)
- Apache Lucene
- Stanford NLP
- JUnit (for testing)

## Contributors

- [Azan Siddiq](https://github.com/azaans23)
- [Taha Mukhtiar Janjua](https://github.com/tahamukhtiar)
- [Abdullah Aleem Gujjar](https://github.com/abdullahaleemgujjar)

## License

This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.

## Acknowledgments

- Special thanks to [NoOne.org](https://noone.org) for their support and inspiration.

---
Feel free to contribute with additional information.