An open API service indexing awesome lists of open source software.

https://github.com/eXascaleInfolab/axel

Project for exploratory search on scientific articles
https://github.com/eXascaleInfolab/axel

Last synced: 10 months ago
JSON representation

Project for exploratory search on scientific articles

Awesome Lists containing this project

README

          

axel
====

Future project to search scientific articles using pre-defined conceps.

Brief details
=============

* We somehow collect research articles from different confrences (ACM proceedings)
* Process all the articles to text, stem/lemmatize, build inverted index
* Load the dictionary of scientific concepts from Computer Science to the system
* Identify all possible concepts inside the article texts, store only TF data
* Create an interface to search for the articles with the concepts you interested in, rank by TF-IDF.
* PROFIT!

Implementation details
======================

* Application should consist of 2 pages initially: one for selecting concepts and another for showings the results.
* Select concepts page: user autocomplete for concept choosing, then show them as labels (we can also automatically suggest user the other possible relevant concepts as those that appear more often with the target ones)

Platform and frameworks details
===============================

* Django as a back-end framework
* Twitter Boostrap as a front-end framework
* Django haystack (Solr backend) for inverted index
* Celery for background tasks