Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/davidhuser/CoolPeopleApp
Extracts persons from text files and exports them as a graph network file (GEXF for Gephi)
https://github.com/davidhuser/CoolPeopleApp
Last synced: 9 days ago
JSON representation
Extracts persons from text files and exports them as a graph network file (GEXF for Gephi)
- Host: GitHub
- URL: https://github.com/davidhuser/CoolPeopleApp
- Owner: davidhuser
- Archived: true
- Created: 2014-05-31T16:09:34.000Z (over 10 years ago)
- Default Branch: master
- Last Pushed: 2014-08-11T17:22:10.000Z (over 10 years ago)
- Last Synced: 2024-08-02T14:08:17.413Z (3 months ago)
- Language: Java
- Homepage:
- Size: 19 MB
- Stars: 1
- Watchers: 4
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
CoolPeopleApp
=============> This application generates a Gephi network graph file of persons which are found in pre-specified english input texts.
> Two persons are in the same network if they are being found in the same document.
> It reads resources as text files, reads them in to a String, extracts names with [Named Entity Extraction](http://opennlp.sourceforge.net/models-1.5/) and a large list of prenames
> and generates based on these found names a [GEXF file (Graph Exchange XML Format)](http://gexf.net/format/).##configuration
Edit Config.java:* input text files - texts should be in english and plain text
* export path of graph file – keep .gexf extension in file path##testing
`mvn test`##building
assemble .jar with dependencies with `mvn clean compile assembly:single`##extending
Since it's based reading a String, more input data types can be added later on (web-fetching, databases, ...)##example
Forbes Top 10 of the richest persons' wikipedia pages [(Wikipedia: The world's billionaires)](http://en.wikipedia.org/wiki/The_World's_Billionaires#2014):
![Graph](http://i.imgur.com/Zpb5YUz.png)More examples in `examples` folder.