useful-java-links
A list of useful Java frameworks, libraries, software and hello worlds examples
https://github.com/Vedenin/useful-java-links
Last synced: 5 days ago
JSON representation
-
I. Development
-
9. Distributed Applications
- JADE - JADE (Java Agent DEvelopment Framework) is a software Framework fully implemented in the Java language. It simplifies the implementation of multi-agent systems through a middle-ware that complies with the FIPA specifications and through a set of graphical tools that support the debugging and deployment phases. License: [GNU Lesser 2.0](https://en.wikipedia.org/wiki/GNU_Lesser_General_Public_License). 
- Apache River - compilani JINI service. License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0). 
- Apache Tuscany - 2.0). 
- Apache Storm - Apache Storm is a free and open source distributed realtime computation system. Storm makes it easy to reliably process unbounded streams of data, doing for realtime processing what Hadoop did for batch processing. Storm is simple, can be used with any programming language, and is a lot of fun to use! [](https://github.com/apache/storm). License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0). 
- Twitter Heron - Heron is realtime analytics platform developed by Twitter. It is the direct successor of Apache Storm, built to be backwards compatible with Storm's topology API but with a wide array of architectural improvements. [](https://github.com/twitter/heron). http://heronstreaming.io License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0). 
- Netflix Curator - 2.0) , [](https://github.com/Netflix/curator). 
- Netflix Hystrix - Hystrix is a latency and fault tolerance library designed to isolate points of access to remote systems, services and 3rd party libraries, stop cascading failure and enable resilience in complex distributed systems where failure is inevitable, [](https://github.com/Netflix/Hystrix). License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0). 
- JGroups - JGroups is a toolkit for reliable messaging. It can be used to create clusters whose nodes can send messages to each other. License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0). 
- Orbit - Orbit is a modern framework for JVM languages that makes it easier to build and maintain distributed and scalable online services. It was developed by BioWare, a division of Electronic Arts. License: [BSD 3](https://en.wikipedia.org/wiki/BSD_licenses#3-clause_license_.28.22Revised_BSD_License.22.2C_.22New_BSD_License.22.2C_or_.22Modified_BSD_License.22.29). 
- Storm contrib - v10.html), [](https://github.com/nathanmarz/storm-contrib). 
- Lagom - Reactive Microservices for the JVM. https://www.lightbend.com/lagom . License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0), [](https://github.com/lagom/lagom). 
- Alibaba Dubbo - 2.0) , [](https://github.com/alibaba/dubbo). 
- Axon Framework - The axon framework is focussed on making life easier for developers that want to create a java application based on the CQRS principles. The framework is used in a lot of environments. License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0). 
-
-
II. Databases, search engines, big data and machine learning
-
1. Databases and storages
- H2 - Welcome to H2, the Java SQL database. The main features of H2 are: Very fast, open source, JDBC API, Embedded and server modes; in-memory databases, Browser based Console application,Small footprint: around 1.5 MB jar file size License: [Mozilla Public License 1.1.](https://en.wikipedia.org/wiki/Mozilla_Public_License) and [Eclipse Public License v1.0](https://en.wikipedia.org/wiki/Eclipse_Public_License). 
- Apache Derby - Apache Derby, an Apache DB subproject, is an open source relational database implemented entirely in Java. Derby provides an embedded JDBC driver that lets you embed Derby in any Java-based solution. License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0). 
- Apache Empire-db - db is a lightweight relational database abstraction layer and data persistence component. License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0). 
- Thinkaurelius Titan - Titan is a highly scalable graph database optimized for storing and querying large graphs with billions of vertices and edges distributed across a multi-machine cluster. Titan is a transactional database that can support thousands of concurrent users, complex traversals, and analytic graph queries. License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0) , [](https://github.com/thinkaurelius/titan). 
- Apache Cassandra - Cassandra is a partitioned row store. Rows are organized into tables with a required primary key. License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0) , [](https://github.com/apache/cassandra). 
- Orientdb - Model DBMS with Document & Graph engine. OrientDB can run distributed (Multi-Master), supports SQL, ACID Transactions, Full-Text indexing, Reactive Queries and has a small memory footprint. OrientDB is licensed with Apache 2 license and the development is driven by OrientDB LTD and a worldwide Open Source community. License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0)/CDDL 1/Eclipse Distribution 1.0, [](https://github.com/orientechnologies/orientdb). 
- Neo4j - 3.0.en.html), [](https://github.com/neo4j/neo4j). 
- Mapdb - heap-memory. It is a fast and easy to use embedded Java database engine. http://www.mapdb.org/. License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0) , [](https://github.com/jankotek/mapdb). 
- Voldemort - value storage system. License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0) , [](https://github.com/voldemort/voldemort). 
- Alluxio (formerly Tachyon) - Speed Virtual Distributed Storage System. License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0) , [](https://github.com/Alluxio/alluxio). 
- Opentsdb - java-links/blob/master/img/impossible-proprietary-code-linking.png?raw=true)
- Hazelcast - Memory Data Grid. License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0) , [](https://github.com/hazelcast/hazelcast). 
- Tinkerpop Blueprints - enabled applications. Likewise, Blueprints-enabled applications can plug-and-play different Blueprints-enabled graph backends. License: [BSD 3](https://en.wikipedia.org/wiki/BSD_licenses#3-clause_license_.28.22Revised_BSD_License.22.2C_.22New_BSD_License.22.2C_or_.22Modified_BSD_License.22.29), [](https://github.com/tinkerpop/blueprints). 
- Apache Lucene solr - 2.0) , [](https://github.com/apache/lucene-solr). 
- Java Chronicle - 2.0) , [](https://github.com/peter-lawrey/Java-Chronicle). 
- Torodb - Open source NoSQL database that runs on top of a RDBMS. Compatible with MongoDB protocol and APIs, but with support for native SQL, atomic operations and reliable and durable backends like PostgreSQL. License: [GNU AGPLv3](http://www.gnu.org/licenses/agpl-3.0.en.html), [](https://github.com/torodb/torodb). 
- Crate - 2.0) , [](https://github.com/crate/crate). 
- Solandra - time distributed search engine built on Apache Solr and Apache Cassandra. License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0) , [](https://github.com/tjake/Solandra). 
- Voltdb - scalable, in-memory SQL RDBMS designed for applications that have extremely high read and write throughput requirements. License: [GNU AGPLv3](http://www.gnu.org/licenses/agpl-3.0.en.html), [](https://github.com/VoltDB/voltdb). 
- Leveldb - for-byte exact copies of the C++ code.. License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0) , [](https://github.com/dain/leveldb). 
- Kairosdb - 2.0) , [](https://github.com/kairosdb/kairosdb). 
- Elephantdb - clause_license_.28.22Revised_BSD_License.22.2C_.22New_BSD_License.22.2C_or_.22Modified_BSD_License.22.29), [](https://github.com/nathanmarz/elephantdb). 
- Apache Drill - 2.0) , [](https://github.com/apache/drill). 
- Tinkerpop Rexster - clause_license_.28.22Revised_BSD_License.22.2C_.22New_BSD_License.22.2C_or_.22Modified_BSD_License.22.29), [](https://github.com/tinkerpop/rexster). 
- Tomcat redis session manager - backed non-sticky session store for Apache Tomcat. License: [MIT](https://opensource.org/licenses/MIT) , [](https://github.com/jcoleman/tomcat-redis-session-manager). 
- Embulk - 2.0) , [](https://github.com/embulk/embulk). 
- Tarantool - source NoSQL database management system and Lua application server. It maintains databases in memory and ensures crash resistance with write-ahead logging. It includes a Lua interpreter and interactive console but also accepts connections from programs in several other languages. License: [BSD licenses](https://en.wikipedia.org/wiki/BSD_licenses). 
- Druid - Druid is a fast column-oriented distributed data store. http://druid.io License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0), [](https://github.com/druid-io/druid). 
- Linkedin Sensei - 2.0) , [](https://github.com/linkedin/sensei). 
-
2. Data structures
- Apache Avro - Apache Avro is a data serialization system. License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0). 
- Apache Orc - The smallest, fastest columnar storage for Hadoop workloads. License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0). 
- Apache Thrift - The Apache Thrift software framework, for scalable cross-language services development, combines a software stack with a code generation engine to build services that work efficiently and seamlessly between C++, Java, Python, PHP, Ruby, Erlang, Perl, Haskell, C#, Cocoa, JavaScript, Node.js, Smalltalk, OCaml and Delphi and other languages. License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0). 
- Persistent Collection - PCollections serves as a persistent and immutable analogue of the Java Collections Framework. License: [MIT](https://opensource.org/licenses/MIT). 
- Protobuf - Protocol Buffers (a.k.a., protobuf) are Google's language-neutral, platform-neutral, extensible mechanism for serializing structured data. License: [BSD 3](https://en.wikipedia.org/wiki/BSD_licenses). 
- Wire - Clean, lightweight protocol buffers for Android and Java. License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0). 
- Persistent Collection - PCollections serves as a persistent and immutable analogue of the Java Collections Framework. License: [MIT](https://opensource.org/licenses/MIT). 
-
3. Search engines
- Lemur Project - java-links/blob/master/img/business-friendly.png?raw=true)
- Elasticsearch - Elasticsearch is a distributed RESTful search engine built for the cloud. License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0) , [](https://github.com/elastic/elasticsearch). 
- Linkedin Indextank Engine - 2.0) , [](https://github.com/linkedin/indextank-engine). 
- Linkedin Cleo - of-order and real-time typeahead search library. License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0) , [](https://github.com/linkedin/cleo). 
- Elasticsearch cloud aws - 2.0) , [](https://github.com/elastic/elasticsearch-cloud-aws). 
- Elasticsearch analysis ik - analysis-ik.svg?style=social&label=Star&maxAge=2592000)](https://github.com/medcl/elasticsearch-analysis-ik).
- YaCy - The YaCy Search Engine application, http://yacy.net/ [](https://github.com/yacy/yacy_search_server). License: [GNU Lesser](https://en.wikipedia.org/wiki/GNU_Lesser_General_Public_License) and [GNU](https://en.wikipedia.org/wiki/GNU_General_Public_License)
- Linkedin Cleo - of-order and real-time typeahead search library. License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0) , [](https://github.com/linkedin/cleo). 
-
4. Client and drivers for databases
- Facebook Presto - 2.0) , [](https://github.com/prestodb/presto). 
- Tinkerpop Gremlin - clause_license_.28.22Revised_BSD_License.22.2C_.22New_BSD_License.22.2C_or_.22Modified_BSD_License.22.29) , [](https://github.com/tinkerpop/gremlin). 
- Mongodb Java Driver - 2.0) / Creative Commons Attribution , [](https://github.com/mongodb/mongo-java-driver). 
- Yahoo Mysql_perf_analyzer - 2.0) , [](https://github.com/yahoo/mysql_perf_analyzer). 
- Airbnb Airpal - 2.0) , [](https://github.com/airbnb/airpal). 
- Mongodb hadoop - hadoop.svg?style=social&label=Star&maxAge=2592000)](https://github.com/mongodb/mongo-hadoop).
- Elasticsearch jdbc - 2.0) , [](https://github.com/jprante/elasticsearch-jdbc). 
- Flyway - 2.0) , [](https://github.com/flyway/flyway). 
- Elasticsearch river mongodb - MongoDB River Plugin for ElasticSearch. License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0) , [](https://github.com/richardwilly98/elasticsearch-river-mongodb). 
- Netflix Astyanax - 2.0) , [](https://github.com/Netflix/astyanax). 
- Liquibase - 2.0) , [](https://github.com/liquibase/liquibase). 
- Couchdb lucene - text searching of CouchDB documents using Lucene. License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0) , [](https://github.com/rnewson/couchdb-lucene). 
- Forcedotcom Phoenix - embedded JDBC driver, powering the HBase use cases at Salesforce.com. Phoenix targets low-latency queries (milliseconds), as opposed to batch operation via map/reduce. License: [BSD 3](https://en.wikipedia.org/wiki/BSD_licenses#3-clause_license_.28.22Revised_BSD_License.22.2C_.22New_BSD_License.22.2C_or_.22Modified_BSD_License.22.29) , [](https://github.com/forcedotcom/phoenix). 
- Variety - java-links/blob/master/img/business-friendly.png?raw=true)
- Netflix Priam - Process for backup/recovery, Token Management, and Centralized Configuration management for Cassandra. License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0) , [](https://github.com/Netflix/Priam). 
- Memcached Java Client - clause_license_.28original_.22BSD_License.22.29), [](https://github.com/gwhalin/Memcached-Java-Client). 
- Spring data mongodb - 2.0) , [](https://github.com/spring-projects/spring-data-mongodb). 
- FlexyPool - FlexyPool adds metrics and failover strategies to a given Connection Pool, allowing it to resize on demand. License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0). 
- Jaybird JCA/JDBC driver - JCA-JDBC Driver for Firebird database. http://www.firebirdsql.org/en/jdbc-driver/, [](https://github.com/FirebirdSQL/jaybird). License: [GNU Lesser 2.1](https://en.wikipedia.org/wiki/GNU_Lesser_General_Public_License). 
- dbeaver - Free universal database manager and SQL client. http://dbeaver.jkiss.org. , [](https://github.com/serge-rider/dbeaver). License: [GNU 2](https://en.wikipedia.org/wiki/GNU_General_Public_License). 
- tarantool-java - java-links/blob/master/img/business-friendly.png?raw=true)
-
5. ORM
- Vibur DBCP - Concurrent, fast, and fully-featured JDBC connection pool, which provides a non-starvation guarantee for application threads, statement caching, slow SQL queries detection and logging, and Hibernate integration, among other features. License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0). 
- OrmLite - Object Relational Mapping Lite (ORM Lite) provides some simple, lightweight functionality for persisting Java objects to SQL databases while avoiding the complexity and overhead of more standard ORM packages. License: [ISC](https://en.wikipedia.org/wiki/%D0%9B%D0%B8%D1%86%D0%B5%D0%BD%D0%B7%D0%B8%D1%8F_ISC). 
- Apache OpenJPA - 2.0). 
- HikariCP - performance JDBC connection pool at last. License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0) , [](https://github.com/brettwooldridge/HikariCP). 
- Mybatis 3 - 2.0) , [](https://github.com/mybatis/mybatis-3). 
- Hibernate orm - orm.svg?style=social&label=Star&maxAge=2592000)](https://github.com/hibernate/hibernate-orm). 
- JOOQ - 2.0) , [](https://github.com/jOOQ/jOOQ). 
- Querydsl - safe SQL-like queries for multiple backends including JPA, MongoDB and SQL in Java. http://www.querydsl.com/ . License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0) , [](https://github.com/querydsl/querydsl). 
- Kundera - Datastore Mapping Library for NoSQL Datastores. License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0) , [](https://github.com/impetus-opensource/Kundera). 
- Spring data jpa - based data access layer. License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0) , [](https://github.com/spring-projects/spring-data-jpa). 
- Jdbi - 2.0) , [](https://github.com/jdbi/jdbi). 
-
6. Working with messy data
- Apache UIMA - 2.0). 
- Storm - Distributed and fault-tolerant realtime computation: stream processing, continuous computation, distributed RPC, and more. License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0) , [](https://github.com/nathanmarz/storm). 
- OpenRefine - java-links/blob/master/img/business-friendly.png?raw=true)
- Addthis Stream lib - Stream summarizer and cardinality estimator. License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0) , [](https://github.com/addthis/stream-lib). 
- HdrHistogram (A High Dynamic Range (HDR) Histogram ) - clause_license_.28.22Simplified_BSD_License.22_or_.22FreeBSD_License.22.29) , [](https://github.com/HdrHistogram/HdrHistogram). 
- HazyResearch DeepDive - 2.0) , [](https://github.com/HazyResearch/deepdive). 
- Seldon Server - 2.0) , [](https://github.com/SeldonIO/seldon-server). 
- Pulsar - Realtime analytics, this includes the core components of Pulsar pipeline. License: [GNU 2](https://en.wikipedia.org/wiki/GNU_General_Public_License).0, [](https://github.com/pulsarIO/realtime-analytics). 
- Suro: Netflix's Data Pipeline - 2.0) , [](https://github.com/Netflix/suro). 
-
7. Big data
- Apache Hama - purpose BSP computing engine. License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0). 
- Apache Falcon - 2.0). 
- Apache REEF - out computing fabric that eases the development of Big Data applications on top of resource managers such as Apache YARN and Mesos. License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0). 
- Apache Accumulo - based access control and a server-side programming mechanism that can modify key/value pairs at various points in the data management process. License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0). 
- Apache Gora - memory data model and persistence for big data. Gora supports persisting to column stores, key value stores, document stores and RDBMSs, and analyzing the data with extensive Apache Hadoop MapReduce support. License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0). 
- Giraph - java-links/blob/master/link-rus/img/good-proprietary.png?raw=true)
- Apache Ambari - to-use Hadoop management web UI backed by its RESTful APIs. License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0). 
- Apache Chukwa - 2.0). 
- Apache Bigtop - 2.0). 
- Apache BookKeeper - 2.0). 
- Apache Crunch - 2.0). 
- Apache MRUnit - 2.0). 
- Apache MetaModel - 2.0). 
- H2o - 2.0) , [](https://github.com/h2oai/h2o-2). 
- Twitter Elephant bird - Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code. License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0) , [](https://github.com/twitter/elephant-bird). 
- Apache Hadoop - The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage. License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0) , [](https://github.com/apache/hadoop). 
- Google Mr4c - java-links/blob/master/img/business-friendly.png?raw=true)
- Alibaba Jstorm - tolerant realtime computation system. Inspired by Apache Storm, JStorm has been completely rewritten in Java and provides many more enhanced features. JStorm has been widely used in many enterprise environments and proved robust and stable. License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0) , [](https://github.com/alibaba/jstorm). 
- Etsy Oculus - java-links/blob/master/img/business-friendly.png?raw=true)
- Linkedin Datafu - scale data in Hadoop. The project was inspired by the need for stable, well-tested libraries for data mining and statistics. License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0) , [](https://github.com/linkedin/datafu). 
- Apache Flink - Apache Flink is an open source stream processing framework with powerful stream- and batch-processing capabilities. License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0) , [](https://github.com/apache/flink). 
- Oryx 2 - Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning. License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0) , [](https://github.com/OryxProject/oryx). 
- Yahoo SAMOA (Scalable Advanced Massive Online Analysis) - source platform for mining big data streams. License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0) , [](https://github.com/yahoo/samoa). 
- Apache Hive - The Apache Hive data warehouse software facilitates querying and
- Apache Kylin - dimensional analysis (OLAP) on Hadoop supporting extremely large datasets. License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0) , [](https://github.com/KylinOLAP/Kylin). 
- Elasticsearch hadoop - Elasticsearch real-time search and analytics natively integrated with Hadoop. License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0) , [](https://github.com/elastic/elasticsearch-hadoop). 
- Impala - Real-time Query for Hadoop. [](https://github.com/cloudera/impala). License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0). 
- Umongo - Desktop app to browse and administer your MongoDB cluster. License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0) , [](https://github.com/agirbal/umongo). 
- Facebook Presto - 2.0) , [](https://github.com/facebook/presto). 
- Cloudera Oryx - time large-scale machine learning / predictive analytics infrastructure. It implements a few classes of algorithm commonly used in business applications: collaborative filtering / recommendation, classification / regression, and clustering. License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0) , [](https://github.com/cloudera/oryx). 
- Linkedin Gobblin - 2.0) , [](https://github.com/linkedin/gobblin). 
- Druid - 2.0) , [](https://github.com/druid-io/druid). 
-
8. Machine Learning
- Apache Marmotta - 2.0). 
- DeepDive - DeepDive is a system to extract value from dark data. Like dark matter, dark data is the great mass of data buried in text, tables, figures, and images, which lacks structure and so is essentially unprocessable by existing software. License: [Creative Commons Attribution 4.0](http://creativecommons.org/licenses/by/4.0/). 
- JAVA-ML - Java Machine Learning Library (Java-ML). License: [GNU 2](https://en.wikipedia.org/wiki/GNU_General_Public_License). 
- JSAT - Numerous Machine Learning algorithms for classification, regression, and clustering. License: [GNU 3](https://en.wikipedia.org/wiki/GNU_General_Public_License). 
- Meka - An open source implementation of methods for multi-label classification and evaluation (extension to Weka). License: [GNU](https://en.wikipedia.org/wiki/GNU_General_Public_License). 
- Neuroph - Neuroph is lightweight Java neural network framework. License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0). 
- wAlnut - Object oriented model of partial human brain with 1 theorized common learning algorithm. Work in progress towards a strong emotional AI. License: [GNU 3](https://en.wikipedia.org/wiki/GNU_General_Public_License) 
- Sat4J - Sat4j is a java library for solving boolean satisfaction and optimization problems. It can solve SAT, MAXSAT, Pseudo-Boolean, Minimally Unsatisfiable Subset (MUS) problems. License: [GNU Lesser 3](https://en.wikipedia.org/wiki/GNU_Lesser_General_Public_License) and [Eclipse Public 1.0](https://www.eclipse.org/legal/epl-v10.html). 
- Apache OpenNLP - The Apache OpenNLP library is a machine learning based toolkit for the processing of natural language text. [Hello World examples](https://github.com/Vedenin/useful-java-links/tree/master/helloworlds/2.8-natural-language-processing/opennlp). License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0). 
- Mallet - MALLET is a Java-based package for statistical natural language processing, document classification, clustering, topic modeling, information extraction, and other machine learning applications to text. License: [Eclipse Public License v1.0](https://en.wikipedia.org/wiki/Eclipse_Public_License). 
- Twitter Text Libraries - This repo is a collection of libraries and conformance tests to standardize parsing of tweet text. It synchronizes development, testing, creating issues, and pull requests for twitter-text's implementations and specification. License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0). 
- Joshua - source statistical machine translation decoder for phrase-based (new in 6.0), hierarchical, and syntax-based machine translation, written in Java. It is developed at the Human Language Technology Center of Excellence at Johns Hopkins University. License: [BSD 2](https://github.com/joshua-decoder/joshua/blob/master/LICENSE). 
- Z-MERT - MERT is a software tool for minimum error rate training of machine translation systems. License: [GNU Lesser General Public License (LGPL)](http://www.gnu.org/licenses/lgpl-3.0.en.html). 
- The Stanford Natural Language Processing Group - A Natural Language Processing Java software tools. Subproject: [Stanford Parser](http://nlp.stanford.edu/software/lex-parser.html) - A statistical parser, [Stanford POS Tagger](http://nlp.stanford.edu/software/tagger.html), [Stanford Named Entity Recognizer](http://nlp.stanford.edu/software/CRF-NER.html), [RegexNER](http://nlp.stanford.edu/software/regexner.html),
- Word Segmenter - based Information Extraction and Diagnostics](http://nlp.stanford.edu/software/patternslearning.html),[Stanford Relation Extractor](http://nlp.stanford.edu/software/relationExtractor.html). License: [GNU 2/3](https://en.wikipedia.org/wiki/GNU_General_Public_License). 
- Weka - Weka is a collection of machine learning algorithms for data mining tasks. The algorithms can either be applied directly to a dataset or called from your own Java code. Weka contains tools for data pre-processing, classification, regression, clustering, association rules, and visualization. It is also well-suited for developing new machine learning schemes. License: [GNU](https://en.wikipedia.org/wiki/GNU_General_Public_License). 
- Airbnb Aerosolve - 2.0) , [](https://github.com/airbnb/aerosolve). 
- Smile - of-art machine learning algorithms. Smile is self contained and requires only Java standard library. [Hello World examples](https://github.com/Vedenin/useful-java-links/tree/master/helloworlds/2.8-machine-learning/smile). License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0) , [](https://github.com/haifengl/smile). 
- Deeplearning4j - licensed, open-source, distributed neural net library written in Java and Scala. http://deeplearning4j.org/ .License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0) , [](https://github.com/deeplearning4j/deeplearning4j). 
- Libsvm - to-use, and efficient software for SVM classification and regression. It solves C-SVM classification, nu-SVM classification, one-class-SVM, epsilon-SVM regression, and nu-SVM regression. License: [BSD 3](https://en.wikipedia.org/wiki/BSD_licenses#3-clause_license_.28.22Revised_BSD_License.22.2C_.22New_BSD_License.22.2C_or_.22Modified_BSD_License.22.29), [](https://github.com/cjlin1/libsvm). 
- Neuralnetworks - vasilev/neuralnetworks.svg?style=social&label=Star&maxAge=2592000)](https://github.com/ivan-vasilev/neuralnetworks). 
- Datumbox framework - source Machine Learning framework written in Java which allows the rapid development of Machine Learning and Statistical applications. License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0) , [](https://github.com/datumbox/datumbox-framework). 
- Apache Mahout - 2.0) , [](https://github.com/apache/mahout). 
- JSAT - Java Statistical Analysis Tool, a Java library for Machine Learning. License: [GNU 3](https://en.wikipedia.org/wiki/GNU_General_Public_License). 
- htm.java - Hierarchical Temporal Memory implementation in Java - an official Community-Driven Java port of the Numenta Platform for Intelligent Computing (NuPIC). License: [GNU AFFERO 3](https://en.wikipedia.org/wiki/Affero_General_Public_License). 
- rapaio - statistics, data mining and machine learning toolbox in Java. License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0). 
- Sat4J - Sat4j is a java library for solving boolean satisfaction and optimization problems. It can solve SAT, MAXSAT, Pseudo-Boolean, Minimally Unsatisfiable Subset (MUS) problems. License: [GNU Lesser 3](https://en.wikipedia.org/wiki/GNU_Lesser_General_Public_License) and [Eclipse Public 1.0](https://www.eclipse.org/legal/epl-v10.html). 
- Stanfordnlp CoreNLP - java-links/tree/master/helloworlds/2.8-natural-language-processing/stanford-core-nlp). License: [GNU 2](https://en.wikipedia.org/wiki/GNU_General_Public_License), [](https://github.com/stanfordnlp/CoreNLP). 
- LingPipe - LingPipe is tool kit for processing text using computational linguistics. License: [AGPL](https://www.gnu.org/licenses/agpl-3.0.html) or ***License: proprietary***.  or 
- NLP4J - 2.0). 
- Joshua - source statistical machine translation decoder for phrase-based (new in 6.0), hierarchical, and syntax-based machine translation, written in Java. It is developed at the Human Language Technology Center of Excellence at Johns Hopkins University. License: [BSD 2](https://github.com/joshua-decoder/joshua/blob/master/LICENSE). 
- ClearTK - ClearTK is a framework for developing machine learning and natural language processing components within the Apache Unstructured Information Management Architecture, http://cleartk.github.io/cleartk/ . [](https://github.com/ClearTK/cleartk) License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0). 
- ARK Twitter NLP - CMU ARK Twitter Part-of-Speech Tagger. http://www.ark.cs.cmu.edu/TweetNLP/ License: [GNU 2/3](https://en.wikipedia.org/wiki/GNU_General_Public_License), [](https://github.com/brendano/ark-tweet-nlp). 
- lc4j - Language Categorization for Java: an Open Source library for finding out in which language a text has been written. License: [MIT](https://en.wikipedia.org/wiki/MIT_License). 
- CMU Sphinx - Open Source Speech Recognition Toolkit. License: [BSD 2](https://en.wikipedia.org/wiki/BSD_licenses#2-clause_license_.28.22Simplified_BSD_License.22_or_.22FreeBSD_License.22.29). 
- Weka - Weka is a collection of machine learning algorithms for data mining tasks. The algorithms can either be applied directly to a dataset or called from your own Java code. Weka contains tools for data pre-processing, classification, regression, clustering, association rules, and visualization. It is also well-suited for developing new machine learning schemes. License: [GNU](https://en.wikipedia.org/wiki/GNU_General_Public_License). 
- DeepDive - DeepDive is a system to extract value from dark data. Like dark matter, dark data is the great mass of data buried in text, tables, figures, and images, which lacks structure and so is essentially unprocessable by existing software. License: [Creative Commons Attribution 4.0](http://creativecommons.org/licenses/by/4.0/). 
- Meka - An open source implementation of methods for multi-label classification and evaluation (extension to Weka). License: [GNU](https://en.wikipedia.org/wiki/GNU_General_Public_License). 
- Neuroph - Neuroph is lightweight Java neural network framework. License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0). 
- Apache SystemML - flexible, scalable machine learning (ML) language. (incubator project). License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0). 
- Protégé - A free, open-source ontology editor and framework for building intelligent systems. License: [BSD 2](https://en.wikipedia.org/wiki/BSD_licenses#2-clause_license_.28.22Simplified_BSD_License.22_or_.22FreeBSD_License.22.29). 
- JAVA-ML - Java Machine Learning Library (Java-ML). License: [GNU 2](https://en.wikipedia.org/wiki/GNU_General_Public_License). 
- The Stanford Natural Language Processing Group - A Natural Language Processing Java software tools. Subproject: [Stanford Parser](http://nlp.stanford.edu/software/lex-parser.html) - A statistical parser, [Stanford POS Tagger](http://nlp.stanford.edu/software/tagger.html), [Stanford Named Entity Recognizer](http://nlp.stanford.edu/software/CRF-NER.html), [RegexNER](http://nlp.stanford.edu/software/regexner.html),
- Word Segmenter - based Information Extraction and Diagnostics](http://nlp.stanford.edu/software/patternslearning.html),[Stanford Relation Extractor](http://nlp.stanford.edu/software/relationExtractor.html). License: [GNU 2/3](https://en.wikipedia.org/wiki/GNU_General_Public_License). 
- Apache Tika - 2.0). 
- Choco - Choco is a Free and Open-Source Software dedicated to Constraint Programming. It aims at describing hard combinatorial problems in the form of Constraint Satisfaction Problems and solving them with Constraint Programming techniques. License: [BSD](https://en.wikipedia.org/wiki/BSD_licenses#3-clause_license_.28.22Revised_BSD_License.22.2C_.22New_BSD_License.22.2C_or_.22Modified_BSD_License.22.29). 
- JaCoP - Java Constraint Programming (JaCoP) solver. **License: ?**
- OptaPlanner - OptaPlanner is a constraint satisfaction solver. It optimizes business resource planning. Every organization faces scheduling puzzles: assign a limited set of constrained resources (employees, assets, time and money) to provide products or services to customers. License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0). 
-
-
III. Network and Integration
-
10. Integratin with API
- Alibaba RocketMQ - 2.0) , [](https://github.com/alibaba/RocketMQ). 
- Twitter Zipkin - 2.0) , [](https://github.com/openzipkin/zipkin). 
- Wizcorp Phonegap facebook plugin - 2.0) , [](https://github.com/Wizcorp/phonegap-facebook-plugin). 
- Twitter Hbc - 2.0) , [](https://github.com/twitter/hbc). 
- Spring social - 2.0) , [](https://github.com/spring-projects/spring-social). 
- Twitter4j - sourced, mavenized and Google App Engine safe Java library for the Twitter API. License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0) , [](https://github.com/yusuke/twitter4j). 
- Twitter Commons - 2.0) , [](https://github.com/twitter/commons). 
- Netflix Ice - 2.0) , [](https://github.com/Netflix/ice). 
- Spring social - 2.0) , [](https://github.com/spring-projects/spring-social). 
-
11. Bitcoin
- XChange - java-links/blob/master/img/business-friendly.png?raw=true)
- bitcoinj - 2.0) , [](https://github.com/bitcoinj/bitcoinj). 
- DiabloMiner - D3/DiabloMiner.svg?style=social&label=Star&maxAge=2592000)](https://github.com/Diablo-D3/DiabloMiner). 
-
12. Clouds
- Apache CloudStack - 2.0). 
- Apache Stratos - extensible Platform-as-a-Service (PaaS) framework that helps run Apache Tomcat, PHP, and MySQL applications and can be extended to support many more environments on all major cloud infrastructures. License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0). 
- Netflix SimianArmy - Tools for keeping your cloud operating in top form. Chaos Monkey is a resiliency tool that helps applications tolerate random instance failures. License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0) , [](https://github.com/Netflix/SimianArmy). 
- Netflix Eureka - tier load balancing and failover.. License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0) , [](https://github.com/Netflix/eureka). 
- Aws sdk java - 2.0) , [](https://github.com/aws/aws-sdk-java). 
- Syncany - java-links/blob/master/img/impossible-proprietary-code-linking.png?raw=true)
- Legacy Jclouds - specific features. We support many clouds including Amazon, VMWare, Azure, and Rackspace. License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0) , [](https://github.com/jclouds/legacy-jclouds). 
- Quarkus - Apache%202.0-blue.svg)](https://github.com/quarkusio/quarkus/blob/master/LICENSE.txt)
- Apache Stratos - extensible Platform-as-a-Service (PaaS) framework that helps run Apache Tomcat, PHP, and MySQL applications and can be extended to support many more environments on all major cloud infrastructures. License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0). 
- Apache Airavata - based science gateways and assist to compose, manage, execute, and monitor large scale applications (wrapped as Web based services) and workflows composed of these services. License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0). 
-
Programming Languages
Categories
Sub Categories
6. Useful libraries
78
1. Testing
58
2. Web development
48
8. Machine Learning
48
7. Big data
32
1. Databases and storages
29
4. Client and drivers for databases
21
2. Deploy, config and build
20
3. Websites
17
5. Opensource applications that were written with Java
16
9. Distributed Applications
16
5. Game Development
15
1. Common frameworks and libraries
14
1. Servers (Web Server and Application Server)
14
5. Rest Frameworks
14
5. Monitoring
14
1. Source code examples
14
3. Message, message broker and message queue
13
3. Perfomance tools
12
3. Continuous Integration
12
8. Json
12
6. Opensource games that were written with Java
11
2. Networking
11
16. XML and SOAP
11
5. ORM
11
4. Formal Verification
10
12. Clouds
10
4. Http and ssh
10
4. Code Analysis
10
6. Working with messy data
9
8. Other
9
14. Document Processing (XLS, DOC and PDF)
9
10. Integratin with API
9
1. IDE
9
8. Code generation and changing byte code
9
4. Business
9
3. Search engines
8
17. Geospatial Service Interation
8
7. Web Crawling and HTML parsering
7
2. Data structures
7
7. Imagery and Video
7
3. GUI
7
11. OSGI
6
1. Program languages that were written with Java
6
2. Code Coverage
5
6. Integration frameworks
5
2. Influential Books
4
10. Science
4
3. Javascript
4
9. CSV
3
13. Cluster Management
3
11. Bitcoin
3
2. Other program languages tools that were written with Java
3
1. Communities
3
15. Native
2
2. Benchmark results
2
18. Reverse Proxy Servers
1
7. Documentation
1
3. Working with git and github
1
4. Frameworks that help to create parsers, interpreters or compilers
1
Keywords
java
140
database
15
sql
12
android
11
framework
11
spring
10
kotlin
10
microservices
9
big-data
8
java-library
8
javascript
8
python
8
jvm
7
hadoop
6
machine-learning
6
static-analysis
6
web-framework
5
scala
5
spring-boot
5
reactive
5
testing
5
jdbc
5
functional-programming
5
data-science
4
http-client
4
ios
4
mysql
4
concurrency
4
json
4
serialization
4
nosql
4
intellij
4
netty
4
data-structures
4
minecraft
4
apache
4
groovy
4
gradle
4
metrics
3
web
3
junit
3
distributed-systems
3
jboss
3
distributed
3
xml
3
rpc
3
jakartaee
3
rest
3
performance
3
maven
3