https://github.com/jmrozanec/jindex
https://github.com/jmrozanec/jindex
Last synced: 5 months ago
JSON representation
- Host: GitHub
- URL: https://github.com/jmrozanec/jindex
- Owner: jmrozanec
- License: apache-2.0
- Created: 2015-11-28T14:37:47.000Z (over 10 years ago)
- Default Branch: master
- Last Pushed: 2015-11-30T05:22:25.000Z (over 10 years ago)
- Last Synced: 2024-12-27T15:11:07.825Z (over 1 year ago)
- Language: Java
- Size: 5.1 MB
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# JINDEX
This project provides means to extract surf Maven central repository and extract metadata from it.
Information about 75%+ of projects by available by mid 2015 is published in graphs folder, so that anyone can use it for analysis.
Graph was creating by considering the following algorithm:
- pick artifact from projectqueue
- if no artifact in queue, pick a random artifact
- check its latest version
- bring immediate dependencies
- for each dependency,
- register it as citation: origin, citedproject,1
- if dependencyId was not processed yet, enqueue it in projectqueue
## Measures of available dataset
- Total number of artifacts indexed (GAV): 1,111,966
- Total number of unique artifacts indexed (GA): 121,550
## Useful information
- [Maven central REST API](http://search.maven.org/ajaxsolr/images/MavenCentralAPIGuide.pdf)
- [Maven policies for good citizens](http://stackoverflow.com/questions/2439891/downloading-complete-maven-remote-repository-to-local-repository)