Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/lordofthejars/nosql-unit

NoSQL Unit is a JUnit extension that helps you write NoSQL unit tests.
https://github.com/lordofthejars/nosql-unit

Last synced: 6 days ago
JSON representation

NoSQL Unit is a JUnit extension that helps you write NoSQL unit tests.

Awesome Lists containing this project

README

        

[![Build Status](https://travis-ci.org/tinesoft/nosql-unit.svg?branch=master)](https://travis-ci.org/lordofthejars/nosql-unit)

Documentation
=============

NoSQLUnit Core
==============

Overview
--------

Unit testing is a method by which the smallest testable part of an
application is validated. Unit tests must follow the FIRST Rules; these
are Fast, Isolated, Repeatable, Self-Validated and Timely.

It is strange to think about a JEE application without persistence layer
(typical Relational databases or new *NoSQL* databases) so should be
interesting to write unit tests of persistence layer too. When we are
writing unit tests of persistence layer we should focus on to not break
two main concepts of FIRST rules, the fast and the isolated ones.

Our tests will be *fast* if they don't access network nor filesystem,
and in case of persistence systems network and filesystem are the most
used resources. In case of RDBMS ( *SQL* ), many Java in-memory
databases exist like Apache Derby , H2 or HSQLDB . These databases, as
their name suggests are embedded into your program and data are stored
in memory, so your tests are still fast. The problem is with *NoSQL*
systems, because of their heterogeneity. Some systems work using
Document approach (like MongoDb ), other ones Column (like Hbase ), or
Graph (like Neo4J ). For this reason the in-memory mode should be
provided by the vendor, there is no a generic solution.

Our tests must be isolated from themselves. It is not acceptable that
one test method modifies the result of another test method. In case of
persistence tests this scenario occurs when previous test method insert
an entry to database and next test method execution finds the change. So
before execution of each test, database should be found in a known
state. Note that if your test found database in a known state, test will
be repeatable, if test assertion depends on previous test execution,
each execution will be unique. For homogeneous systems like RDBMS ,
*DBUnit* exists to maintain database in a known state before each
execution. But there is no like *DBUnit* framework for heterogeneous
*NoSQL* systems.

*NoSQLUnit* resolves this problem by providing a *JUnit* extension which
helps us to manage lifecycle of NoSQL systems and also take care of
maintaining databases into known state.

Requirements
------------

To run *NoSQLUnit* , *JUnit 4.10* or later must be provided. This is
because of *NoSQLUnit* is using *Rules* , and they have changed from
previous versions to 4.10.

Although it should work with JDK 5 , jars are compiled using JDK 6 .

NoSQLUnit
---------

*NoSQLUnit* is a *JUnit* extension to make writing unit and integration
tests of systems that use NoSQL backend easier and is composed by two
sets of *Rules* and a group of annotations.

First set of *Rules* are those responsible of managing database
lifecycle; there are two for each supported backend.

- The first one (in case it is possible) it is the *in-memory* mode.
This mode takes care of starting and stopping database system in "
*in-memory* " mode. This mode will be typically used during unit
testing execution.

- The second one is the *managed* mode. This mode is in charge of
starting *NoSQL* server but as remote process (in local machine) and
stopping it. This will typically used during integration testing
execution.

You can add them in Test Suites and/or Tests Classes, *NoSQLUnit* takes care of only starting database once.

Second set of *Rules* are those responsible of maintaining database into
known state. Each supported backend will have its own, and can be
understood as a connection to defined database which will be used to
execute the required operations for maintaining the stability of the
system.

Note that because *NoSQL* databases are heterogeneous, each system will
require its own implementation.

And finally two annotations are provided,
[@UsingDataSet](#seeding_database) and
[@ShouldMatchDataSet](#verifying_database) , (thank you so much
*Arquillian* people for the name).

### Seeding Database

@UsingDataSet is used to seed database with defined data set. In brief
data sets are files that contain all data to be inserted to configured
database. In order to seed your database, use *@UsingDataSet*
annotation, you can define it either on the test itself or on the class
level. If there is definition on both, test level annotation takes
precedence. This annotation has two attributes locations and
loadStrategy .

With locations attribute you can specify *classpath* datasets location.
Locations are relative to test class location. Note that more than one
dataset can be specified.

Also withSelectiveLocations attribute can be used to specify datasets location. See Advanced Usage chapter for more information.

If files are not specified explicitly, next
strategy is applied:

- First searches for a file on classpath in same package of test class
with next file name, `[test class name]#[test method
name].[format]
` (only if annotation is present at test
method).

- If first rule is not met or annotation is defined at class scope,
next file is searched on classpath in same package of test class,
`[test class name].[default format]` .

> **Warning**
>
> datasets must reside into *classpath* and format depends on *NoSQL*
> vendor.

Second attribute provides strategies for inserting data. Implemented
strategies are:

--------------- ----------------------------------------------------------------------------------------------------------------------------------------
INSERT Insert defined datasets before executing any test method.
DELETE\_ALL Deletes all elements of database before executing any test method.
CLEAN\_INSERT This is the most used strategy. It deletes all elements of database and then insert defined datasets before executing any test method.
--------------- ----------------------------------------------------------------------------------------------------------------------------------------

: Load Strategies

An example of usage:

~~~~ {.java}
@UsingDataSet(locations="my_data_set.json", loadStrategy=LoadStrategyEnum.INSERT)
~~~~

### Verifying Database

Sometimes it might imply a huge amount of work asserting database state
directly from testing code. By using *@ShouldMatchDataSet* on test
method, *NoSQLUnit* will check if database contains expected entries
after test execution. As with *@ShouldMatchDataSet* annotation you can
define classpath file location, or using withSelectiveMatche, see Advanced Usage chapter for more information.
If it is not dataset supplied next convention is used:

- First searches for a file on classpath in same package of test class
with next file name, `[test class name]#[test method
name]-expected.[format]
` (only if annotation is present at test
method).

- If first rule is not met or annotation is defined at class scope,
file is searched on classpath in same package of test class,
`[test class name]-expected.[default format]` .

> **Warning**
>
> datasets must reside into *classpath* and format depends on *NoSQL*
> vendor.

An example of usage:

~~~~ {.java}
@ShouldMatchDataSet(location="my_expected_data_set.json")
~~~~

MongoDB Engine
==============

MongoDB
=======

MongoDB is a *NoSQL* database that stores structured data as *JSON-like*
documents with dynamic schemas.

**NoSQLUnit** supports *MongoDB* by using next classes:

----------- -----------------------------------------------------
In Memory com.lordofthejars.nosqlunit.mongodb.InMemoryMongoDb
Managed com.lordofthejars.nosqlunit.mongodb.ManagedMongoDb
----------- -----------------------------------------------------

: Lifecycle Management Rules

---------------------- -------------------------------------------------
NoSQLUnit Management com.lordofthejars.nosqlunit.mongodb.MongoDbRule
---------------------- -------------------------------------------------

: Manager Rule

Maven Setup
-----------

To use **NoSQLUnit** with MongoDb you only need to add next dependency:

~~~~ {.xml}

com.lordofthejars
nosqlunit-mongodb
${version.nosqlunit}

~~~~

Note that if you are plannig to use **in-memory** approach it is implemented using
*Fongo* . *Fongo* is a new project that help with unit testing
Java-based MongoDb Applications. [Fongo](https://github.com/fakemongo/fongo)

Dataset Format
--------------

Default dataset file format in *MongoDB* module is *json* .

Datasets must have next [format](#ex.mongodb_dataset) :

~~~~ {.json}
{
"name_collection1": [
{
"attribute_1":"value1",
"attribute_2":"value2"
},
{
"attribute_3":2,
"attribute_4":"value4"
}
],
"name_collection2": [
...
],
....
}
~~~~

Notice that if attributes value are integers, double quotes are not
required.

> If you want to use ISODate function or any other javascript function you should see how *MongoDB* Java Driver deals with it. For example in case of ISODate:

~~~~ {.json}
"bornAt":{ "$date" : "2011-01-05T10:09:15.210Z"}
~~~~

With last versions of **MongoDB**, index support is also implemented allowing developers to define indexes through defined document properties. For more information visit [MongoDB](http://docs.mongodb.org/manual/core/indexes/). In this case dataset has been changed to let us define indexes too.

~~~~ {.json}
{
"collection1":{
"indexes":[
{
"index":{
"code":1
}
}
],
"data":[
{
"id":1,
"code":"JSON dataset"
},
{
"id":2,
"code":"Another row"
}
]
}
}
~~~~

Note that we define the collection name, and then we define two subdocuments. The first one is where we define an array of indexes, all of them related to defined collection and we define which fields are going to be indexed (same as document as defined in **MongoDB** index specification). And then data property, where we define all documents that goes into collection under test.

Getting Started
---------------

### Lifecycle Management Strategy

First step is defining which lifecycle management strategy is required
for your tests. Depending on kind of test you are implementing (unit
test, integration test, deployment test, ...) you will require an
**in-memory** approach, **managed** approach or **remote** approach.

To configure **in-memory** approach you should only instantiate next
[rule](#program.inmemory_conf) :

~~~~ {.java}
import static com.lordofthejars.nosqlunit.mongodb.InMemoryMongoDb.InMemoryMongoRuleBuilder.newInMemoryMongoDbRule;

@ClassRule
public static InMemoryMongoDb inMemoryMongoDb = newInMemoryMongoDbRule().build();
~~~~

To configure the **managed** way, you should use ManagedMongoDb rule and
may require some [configuration](#program.managed_conf) parameters.

~~~~ {.java}
import static com.lordofthejars.nosqlunit.mongodb.ManagedMongoDb.MongoServerRuleBuilder.newManagedMongoDbRule;

@ClassRule
public static ManagedMongoDb managedMongoDb = newManagedMongoDbRule().build();
~~~~

By default managed *MongoDB* rule uses next default values:

- *MongoDB* installation directory is retrieved from `MONGO_HOME`
system environment variable.

- Target path, that is the directory where *MongoDb* server is
started, is `target/mongo-temp` .

- Database path is at `{target
path}
` `/mongo-dbpath` .

- Because after execution of tests all generated data is removed, in
`{target
path}
` `/logpath` will remain log file
generated by the server.

- In *Windows* systems executable should be found as `bin/mongod.exe`
meanwhile in *MAC OS* and *\*nix* should be found as `bin/mongod` .

- No journaling.

ManagedMongoDb can be created from scratch, but for making life easier,
a *DSL* is provided using MongoServerRuleBuilder class. For
[example](#program.managed_specific_conf) :

~~~~ {.java}
import static com.lordofthejars.nosqlunit.mongodb.ManagedMongoDb.MongoServerRuleBuilder.newManagedMongoDbRule;

@ClassRule
public static ManagedMongoDb managedMongoDb =
newManagedMongoDbRule().mongodPath("/opt/mongo").appendSingleCommandLineArguments("-vvv").build();
~~~~

In [example](#program.managed_specific_conf) we are overriding
`MONGO_HOME` variable (in case has been set) and set mongo home at
`/opt/mongo` . Moreover we are appending a single argument to *MongoDB*
executable, in this case setting log level to number 3 (-vvv). Also you
can append *property=value* arguments using
`appendCommandLineArguments(String argumentName, String
argumentValue)
` method.

> **Warning**
>
> when you are specifying command line arguments, remember to add slash
> (-) and double slash (--) where is necessary.

To stop *MongoDB* instance, **NoSQLUnit** sends a `shutdown` command to
server using *Java Mongo API*.

Configuring **remote** approach does not require any special rule
because you (or System like Maven ) is the responsible of starting and
stopping the server. This mode is used in deployment tests where you are
testing your application on real environment.

### Configuring MongoDB Connection

Next step is configuring ***MongoDB*** rule in charge of maintaining
*MongoDB* database into known state by inserting and deleting defined
datasets. You must register MongoDbRule *JUnit* rule class, which
requires a configuration parameter with information like host, port or
database name.

To make developer's life easier and code more readable, a fluent
interface can be used to create these configuration objects. Two
different kind of configuration builders exist.

The first one is for configuring a connection to in-memory *Fongo*
server. For almost all cases default parameters are enough.

~~~~ {.java}
import static com.lordofthejars.nosqlunit.mongodb.MongoDbRule.MongoDbRuleBuilder.newMongoDbRule;

@Rule
public MongoDbRule embeddedMongoDbRule = newMongoDbRule().defaultEmbeddedMongoDb("test");
~~~~

The second one is for configuring a connection to managed/remote *MongoDB*
server. Default values are:

---------------- -------------------------------
Host localhost
Port 27017
Authentication No authentication parameters.
---------------- -------------------------------

: Default Managed Configuration Values

~~~~ {.java}
import static com.lordofthejars.nosqlunit.mongodb.MongoDbConfigurationBuilder.mongoDb;

@Rule
public MongoDbRule remoteMongoDbRule = new MongoDbRule(mongoDb().databaseName("test").build());
~~~~

~~~~ {.java}
import static com.lordofthejars.nosqlunit.mongodb.MongoDbConfigurationBuilder.mongoDb;

@Rule
public MongoDbRule remoteMongoDbRule = new MongoDbRule(mongoDb().databaseName("test").host("my_remote_host").build());
~~~~

But also we can define it to use Spring Data MongoDB defined instance.

If you are plannig to use **Spring Data MongoDB**, you may require to use the *Mongo* instance defined within Spring Application Context, mostly because you are defining an embedded connection using **Fongo**:

~~~~ {.xml}

~~~~

In these cases you should use an special method which gets *Mongo* **Fongo** instance, instead of creating new one.

~~~~ {.java}
@Autowired
private ApplicationContext applicationContext;

@Rule
public MongoDbRule mongoDbRule = newMongoDbRule().defaultSpringMongoDb("test");
~~~~

> Note that you need to autowire the application context, so **NoSQLUnit** can inject instance defined within application context into *MongoDbRule*.

### Complete Example

Consider a library application, which apart from multiple operations, it
allow us to add new books to system. Our [model](#example.book_model) is
as simple as:

~~~~ {.java}
public class Book {

private String title;

private int numberOfPages;

public Book(String title, int numberOfPages) {
super();
this.title = title;
this.numberOfPages = numberOfPages;
}

public void setTitle(String title) {
this.title = title;
}

public void setNumberOfPages(int numberOfPages) {
this.numberOfPages = numberOfPages;
}

public String getTitle() {
return title;
}

public int getNumberOfPages() {
return numberOfPages;
}
}
~~~~

Next business [class](#example.book_manager) is the responsible of
managing access to *MongoDb* server:

~~~~ {.java}
public class BookManager {

private static final Logger LOGGER = LoggerFactory.getLogger(BookManager.class);

private static final MongoDbBookConverter MONGO_DB_BOOK_CONVERTER = new MongoDbBookConverter();
private static final DbObjectBookConverter DB_OBJECT_BOOK_CONVERTER = new DbObjectBookConverter();

private DBCollection booksCollection;

public BookManager(DBCollection booksCollection) {
this.booksCollection = booksCollection;
}

public void create(Book book) {
DBObject dbObject = MONGO_DB_BOOK_CONVERTER.convert(book);
booksCollection.insert(dbObject);
}
}
~~~~

And now it is time for testing. In next
[test](#example.test_insert_book) we are going to validate that a book
is inserted correctly into database.

~~~~ {.java}
package com.lordofthejars.nosqlunit.demo.mongodb;

public class WhenANewBookIsCreated {

@ClassRule
public static ManagedMongoDb managedMongoDb = newManagedMongoDbRule().mongodPath("/opt/mongo").build();

@Rule
public MongoDbRule remoteMongoDbRule = new MongoDbRule(mongoDb().databaseName("test").build());

@Test
@UsingDataSet(locations="initialData.json", loadStrategy=LoadStrategyEnum.CLEAN_INSERT)
@ShouldMatchDataSet(location="expectedData.json")
public void book_should_be_inserted_into_repository() {

BookManager bookManager = new BookManager(MongoDbUtil.getCollection(Book.class.getSimpleName()));

Book book = new Book("The Lord Of The Rings", 1299);
bookManager.create(book);
}

}
~~~~

In [previous](#example.test_insert_book) test we have defined that
*MongoDB* will be managed by test by starting an instance of server
located at `/opt/mongo` . Moreover we are setting an
[initial](#example.dataset_book) dataset in file `initialData.json`
located at classpath
`com/lordofthejars/nosqlunit/demo/mongodb/initialData.json
` and [expected](#example.expected_dataset_book)
dataset called `expectedData.json` .

~~~~ {.json}
{
"Book":
[
{"title":"The Hobbit","numberOfPages":293}
]
}
~~~~

~~~~ {.json}
{
"Book":
[
{"title":"The Hobbit","numberOfPages":293},
{"title":"The Lord Of The Rings","numberOfPages":1299}
]
}
~~~~

You can watch full example at
[github](https://github.com/lordofthejars/nosql-unit/tree/master/nosqlunit-demo)
.

Replica Set
-----------
### Introduction

Database replication in **MongoDB** adds redundancy and high availability of the data.
In case of **MongoDB** instead of having traditional master-slave pattern architecture, it implements _Replica Set_ architecture,
which can be understood as more sophisticated master-slave replication. For more information about _Replica Set_ read
[mongoDB](http://docs.mongodb.org/manual/core/replication/)

### Set up and Start Replica Set architecture

In **NoSQLUnit** we can define a replica set architecture and starting it up, so our tests are executed against a replica set servers instead of a single server. Due the nature of replica set system, we can only create a replica set of managed servers.

So let's see how to define an architecture and starting all related servers. The main class is *ReplicaSetManagedMongoDb* which manages lifecycle of all servers involved in replica set. To build a *ReplicaSetManagedMongoDb* class, *ReplicaSetBuilder* builder class is provided and it will allow us to define the replica set architecture. Using it we can set the eligible servers (those that can be primary or secondary), the only secondary servers, the arbiters, the hidden ones, and configure all of them with the attributes like priority, voters, or setting tags.

So let's see an example where we are defining two eligible servers and one arbiter in a replica set called rs-test.

~~~~ {.java}
import static com.lordofthejars.nosqlunit.mongodb.replicaset.ReplicaSetBuilder.replicaSet;

@ClassRule
public static ReplicaSetManagedMongoDb replicaSetManagedMongoDb = replicaSet(
"rs-test")
.eligible(
newManagedMongoDbLifecycle().port(27017)
.dbRelativePath("rs-0").logRelativePath("log-0")
.get())
.eligible(
newManagedMongoDbLifecycle().port(27018)
.dbRelativePath("rs-1").logRelativePath("log-1")
.get())
.arbiter(
newManagedMongoDbLifecycle().port(27019)
.dbRelativePath("rs-2").logRelativePath("log-2")
.get())
.get();
~~~~

Notice that you must define different port for each server and also a different database path. Also note that *ReplicaSetManagedMongoDb* won't let start executing tests until all replica set becomes stable (this can take some minutes).

Then we only have to create a _MongoDbRule_ as usually which will populate defined data into replica set servers. For this case a new configuration builder is provided that allows us to define the mongo servers location and the write concern used during seeding phase. By default _Aknownledge_ write concern is used.

~~~~ {.java}
import static com.lordofthejars.nosqlunit.mongodb.ReplicationMongoDbConfigurationBuilder.replicationMongoDbConfiguration;

@Rule
public MongoDbRule mongoDbRule = newMongoDbRule().configure(
replicationMongoDbConfiguration().databaseName("test")
.seed("localhost", 27017)
.seed("localhost", 27018)
.configure())
.build();
~~~~

Now we have configured and deployed a replica set and populated them with the dataset.

But **NoSQLUnit** also provides an utility method to cause server failures. It is as easy as calling _shutdownServer_ method.

~~~~ {.java}
replicaSetManagedMongoDb.shutdownServer(27017);
~~~~

Keep in mind two aspects of using this method:

- Because _@ClassRule_ is used, we are responsible for restarting the system by calling _startServer_.
- System may become unstable and Mongo driver can throw many exceptions (that's normal because of MonitorThread) and even do some test fails. If you want to wait until all servers become stable again (in real life you won't have this possibility), you can use next call:

~~~~ {.java}
replicaSetManagedMongoDb.waitUntilReplicaSetBecomesStable();
~~~~

Also you can use **NoSQLUnit** to test your replica set deployment of remote servers. You can use MongoDbCommands to retrieve replica set configuration.

~~~~ {.java}
DBObject replicaSetGetStatus = MongoDbCommands.replicaSetGetStatus(mongoClient);
~~~~

And in previous case replicaSetGetStatus contains a json document with the format described in [MongoDB](http://docs.mongodb.org/manual/reference/replica-status/).

You can watch full example in [github](https://github.com/lordofthejars/nosql-unit/tree/master/nosqlunit-demo).

Sharding
--------

### Introduction

Sharding is another way of replication, but in this case we are scaling horizontally. MongoDB partitions a collection and stores the different portions on different machines. From a logical overview client only see one single database, but internally a cluster of machines are being used with data spread across all system.

To run sharding we must set up a sharded cluster. A sharded cluster is composed by next elements:

- shards which are _mongod_ instances that holds a portion of the database collections.
- config servers which stores metadata about the clusters.
- mongos servers determine the location of required data from shards.

Apart from setting up a sharding architecture, we also have to register each shard, enable sharding for database, enable sharding for each collection we want to partition, and defining which element of the document is used to calculate the shard key.

For more information about _Sharding_ read [mongoDB](http://docs.mongodb.org/manual/sharding/)

### Set up and Start Sharding

In **NoSQLUnit** we can define a sharding architecture and starting it up, so our tests are executed against it instead of a single server. Due the nature of sharding system, we can only create sharding for managed servers.

So let's see how to define an architecture and starting all related servers. The main class is *ShardedManagedMongoDb* which manages lifecycle of all servers involved in sharding (shards, configs and mongos). To build a *ShardedManagedMongoDb* class, *ShardedGroupBuilder* builder class is provided and it will allow us to define each server involved in sharding.

Let's see an example on how to set up and start a system with two shards, one config server and one mongos.

~~~~ {.java}
@ClassRule
public static ShardedManagedMongoDb shardedManagedMongoDb = shardedGroup()
.shard(newManagedMongoDbLifecycle().port(27018).dbRelativePath("rs-1").logRelativePath("log-1").get())
.shard(newManagedMongoDbLifecycle().port(27019).dbRelativePath("rs-2").logRelativePath("log-2").get())
.config(newManagedMongoDbLifecycle().port(27020).dbRelativePath("rs-3").logRelativePath("log-3").get())
.mongos(newManagedMongosLifecycle().configServer(27020).get())
.get();
~~~~

Notice that you must define different port for each server and also a different database path. Also note that in case of _mongos_ you must set the config server port, and is not necessary to set up the database path.

And finally we only have to create a _MongoDbRule_ as usually which will populate defined data into sharding servers. For this case we must use the same builder used for replica set but enabling sharding. Keep in mind that in this case we only have to register the mongos instances, not shards or config servers.

~~~~ {.java}
@Rule
public MongoDbRule mongoDbRule = newMongoDbRule().configure(
replicationMongoDbConfiguration().databaseName("test")
.enableSharding()
.seed("localhost", 27017)
.configure())
.build();
~~~~

And finally the dataset format is changed from the standard one to allow us define which attributes are used as shards. Let's see an example:

~~~~ {.json}
{
"collection_name": {
"shard-key-pattern": ["attribute_1", "attribute_2"],
"data":
[
{"attribute_1":"value_1","attribute_2":value_2, "attribute_3":"value_3"}
]
}
}
~~~~

For each collection you define which attributes are used for calculating the shard key by using _shard-key-pattern_ attribute, and finally using _data_ attribute we set the whole document which will be inserted into collection.

In case we use this dataset as expected dataset, _shard-key-pattern_ is ignored, and only _data_ document is used for comparison.

Replicated Sharded Cluster
--------------------------

### Introduction

The third way of replication is an hybrid. Each shard contains a replica set with n-member replica set. And as sharding at least one _config_ server and one _mongos_ server is required.

For more information about _Replicated Sharded Cluster_ read [mongoDB](http://docs.mongodb.org/manual/tutorial/convert-replica-set-to-replicated-shard-cluster/)

### Set up and Start Sharding

In **NoSQLUnit** we can define a replicated sharded cluster architecture and starting it up, so our tests are executed against it instead of a single server. Due the nature of replicated sharded cluster, we can only create sharding for managed servers.

So let's see how to define an architecture and starting all related servers. The main class is *ShardedManagedMongoDb* which manages lifecycle of all servers involved in sharding (shards, configs and mongos). To build a *ShardedManagedMongoDb* class, *ShardedGroupBuilder* builder class is provided and it will allow us to define each server involved in sharding, but in contrast of sharding, we need to add a replica set instead of a shard. For this reason *ReplicaSetManagedMongoDb* is also used.

Let's see an example on how to set up two replicated sharded cluster, with one member each replica set, (of course in production environment you would have more), one config server and one mongos.

~~~~ {.java}
import static com.lordofthejars.nosqlunit.mongodb.shard.ShardedGroupBuilder.shardedGroup;
import static com.lordofthejars.nosqlunit.mongodb.replicaset.ReplicaSetBuilder.replicaSet;

@ClassRule
public static ShardedManagedMongoDb shardedManagedMongoDb = shardedGroup()
.replicaSet(replicaSet("rs-test-1")
.eligible(
newManagedMongoDbLifecycle()
.port(27007).dbRelativePath("rs-0").logRelativePath("log-0")
.get()
)
.get())
.replicaSet(replicaSet("rs-test-2")
.eligible(
newManagedMongoDbLifecycle()
.port(27009).dbRelativePath("rs-0").logRelativePath("log-0")
.get()
)
.get())
.config(newManagedMongoDbLifecycle().port(27020).dbRelativePath("rs-3").logRelativePath("log-3").get())
.mongos(newManagedMongosLifecycle().configServer(27020).get())
.get();
~~~~

Note that we are using the _replicaSet_ method of _shardedGroup_ to create a replica set inside a sharded, and then we use methods defined into *ReplicaSetBuilder* to configure the replica set.

And finally we only have to create a _MongoDbRule_ as usually which will populate defined data into servers. For replicated sharded clusters we can use the same class and dataset as sharding.

~~~~ {.java}
@Rule
public MongoDbRule mongoDbRule = newMongoDbRule().configure(
replicationMongoDbConfiguration().databaseName("test")
.enableSharding()
.seed("localhost", 27017)
.configure())
.build();
~~~~

and

~~~~ {.json}
{
"collection_name": {
"shard-key-pattern": ["attribute_1", "attribute_2"],
"data":
[
{"attribute_1":"value_1","attribute_2":value_2, "attribute_3":"value_3"}
]
}
}
~~~~

Neo4j Engine
============

Neo4j
=====

Neo4j is a high-performance, *NoSQL* graph database with all the
features of a mature and robust database.

**NoSQLUnit** supports *Neo4j* by using next classes:

------------------ ------------------------------------------------------------
In Memory com.lordofthejars.nosqlunit.neo4j.InMemoryNeo4j
Embedded com.lordofthejars.nosqlunit.neo4j.EmbeddedNeo4j
Managed Wrapping com.lordofthejars.nosqlunit.neo4j.ManagedWrappingNeoServer
Managed com.lordofthejars.nosqlunit.neo4j.ManagedNeoServer
------------------ ------------------------------------------------------------

: Lifecycle Management Rules

---------------------- ---------------------------------------------
NoSQLUnit Management com.lordofthejars.nosqlunit.neo4j.Neo4jRule
---------------------- ---------------------------------------------

: Manager Rule

Maven Setup
-----------

To use **NoSQLUnit** with Neo4j you only need to add next dependency:

~~~~ {.xml}

com.lordofthejars
nosqlunit-neo4j
${version.nosqlunit}

~~~~

Dataset Format
--------------

Default dataset file format in *Neo4j* module is
[GraphML](http://graphml.graphdrawing.org/) . *GraphML* is a
comprehensive and easy-to-use file format for graphs.

Datasets must have next [format](#ex.neo4j_dataset) :

~~~~ {.xml}





value1
myvalue


value2


float
myvalue

~~~~

where:

- *graphml* : the root element of the GraphML document

- *key* : description for graph element properties, you must define if
property type is for nodes or relationships, name, and type of
element. In our case string, int, long, float, double and boolean
are supported.

- *graph* : the beginning of the graph representation. In our case
only one level of graphs are supported. Inner graphs will be
ignored.

- *node* : the beginning of a vertex representation. Please note that
id 0 is reserved for reference node, so cannot be used as id.

- *edge* : the beginning of an edge representation. Source and target
attributes are filled with node id. If you want to link with
reference node, use a 0 which is the id of root node. Note that
label attribute is not in defined in standard definition of GraphML
specification; GraphML supports adding new attributes to all GrpahML
elements, and label attribute has been added to facilitate the
creation of edge labels.

- *data* : the key/value data associated with a graph element. Data
value will be validated against type defined in key element.

- *attr.autoindexName* : this attribute is optional and can only set in *key* element.
It creates an index with given name for properties of that type for all nodes or edges.

- *index* : This tag is optional and creates an index with given name, key and value in
the *node* or *edge* where it is declared.

Getting Started
---------------

### Lifecycle Management Strategy

First step is defining which lifecycle management strategy is required
for your tests. Depending on kind of test you are implementing (unit
test, integration test, deployment test, ...) you will require an
in-memory approach, embedded approach, managed approach or remote
approach.

#### In-memory Lifecycle

To configure **in-memory** approach you should only instantiate next
[rule](#program.neo_inmemory_conf) :

~~~~ {.java}
import static com.lordofthejars.nosqlunit.neo4j.InMemoryNeo4j.InMemoryNeo4jRuleBuilder.newInMemoryNeo4j;

@ClassRule
public static InMemoryNeo4j inMemoryNeo4j = newInMemoryNeo4j().build();
~~~~

#### Embedded Lifecycle

To configure **embedded** approach you should only instantiate next
[rule](#program.neo_embedded_conf) :

~~~~ {.java}
import static com.lordofthejars.nosqlunit.neo4j.EmbeddedNeo4j.EmbeddedNeo4jRuleBuilder.newEmbeddedNeo4jRule;

@ClassRule
public static EmbeddedNeo4j embeddedNeo4j = newEmbeddedNeo4jRule().build();
~~~~

By default embedded *Neo4j* rule uses next default values:

------------- ------------------------------------------------------------------------------------
Target path This is the directory where *Neo4j* server is started and is `target/neo4j-temp` .
------------- ------------------------------------------------------------------------------------

: Default Embedded Values

#### Managed Lifecycle

To configure managed way, two possible approaches can be used:

The first one is using an **embedded database wrapped by a server** .
This is a way to give an embedded database visibility through network
(internally we are creating a WrappingNeoServerBootstrapper instance) :

~~~~ {.java}
import static com.lordofthejars.nosqlunit.neo4j.ManagedWrappingNeoServer.ManagedWrappingNeoServerRuleBuilder.newWrappingNeoServerNeo4jRule;

@ClassRule
public static ManagedWrappingNeoServer managedWrappingNeoServer = newWrappingNeoServerNeo4jRule().port(8888).build();
~~~~

By default wrapped managed *Neo4j* rule uses next default values, but
can be configured programmatically as shown in previous
[example](#program.neo_wrapped_managed_conf) :

------------- ----------------------------------------------------------------------------
Target path The directory where *Neo4j* server is started and is `target/neo4j-temp` .
Port Where server is listening incoming messages is 7474.
------------- ----------------------------------------------------------------------------

: Default Wrapped Values

The second strategy is **starting and stopping an already installed
server** on executing machine, by calling start and stop command lines.
Next [rule](#program.neo_managed_conf) should be registered:

~~~~ {.java}
import static com.lordofthejars.nosqlunit.neo4j.ManagedNeoServer.Neo4jServerRuleBuilder.newManagedNeo4jServerRule;

@ClassRule
public static ManagedNeoServer managedNeoServer = newManagedNeo4jServerRule().neo4jPath("/opt/neo4j").build();
~~~~

By default managed *Neo4j* rule uses next default values, but can be
configured programmatically as shown in previous
[example](#program.neo_managed_conf) :

------------- -------------------------------------------------------------------------------------------------------------
Target path This is the directory where *Neo4j* process will be started and by default is `target/neo4j-temp` .
Port Where server is listening incoming messages is 7474.
Neo4jPath *Neo4j* installation directory which by default is retrieved from `NEO4J_HOME` system environment variable.
------------- -------------------------------------------------------------------------------------------------------------

: Default Managed Values

> **Warning**
>
> Versions prior to *Neo4j* 1.8, port cannot be configured from command
> line, and port should be changed manually in
> `conf/neo4j-server.properties` . Although this restriction, if you
> have configured *Neo4j* to run through a different port, it should be
> specified too in ManagedNeoServer rule.

#### Remote Lifecycle

Configuring **remote** approach does not require any special rule
because you (or System like Maven ) is the responsible of starting and
stopping the server. This mode is used in deployment tests where you are
testing your application on real environment.

### Configuring Neo4j Connection

Next step is configuring **Neo4j** rule in charge of maintaining *Neo4j*
graph into known state by inserting and deleting defined datasets. You
must register Neo4jRule *JUnit* rule class, which requires a
configuration parameter with information like host, port, uri or target
directory.

To make developer's life easier and code more readable, a fluent
interface can be used to create these configuration objects. Two
different kind of configuration builders exist.

#### In-Memory/Embedded Connection

The first one is for configuring a connection to in-memory/embedded
*Neo4j* .

~~~~ {.java}
import static com.lordofthejars.nosqlunit.neo4j.EmbeddedNeoServerConfigurationBuilder.newEmbeddedNeoServerConfiguration;

@Rule
public Neo4jRule neo4jRule = new Neo4jRule(newEmbeddedNeoServerConfiguration().build());
~~~~

If you are only registering one embedded *Neo4j* instance like previous
[example](#program.neo_embedded_conf) , calling `build` is enough. If
you are using more than one *Neo4j* embedded connection like explained
in [Simultaneous Engine](#advanced.simultaneous-engine-title) section,
`targetPath` shall be provided by using `buildFromTargetPath` method.

If you are using in-memory approach mixed with embedded approach, target
path for in-memory instance can be found at
`InMemoryNeo4j.INMEMORY_NEO4J_TARGET_PATH` variable.

#### Managed/Remote Connection

The second one is for configuring a connection to remote *Neo4j* server
(it is irrelevant at this level if it is wrapped or not). Default values
are:

---------------- -------------------------------
Connection URI http://localhost:7474/db/data
Authentication No authentication parameters.
---------------- -------------------------------

: Default Managed Connection Values

~~~~ {.java}
import static com.lordofthejars.nosqlunit.neo4j.ManagedNeoServerConfigurationBuilder.newManagedNeoServerConfiguration;

@Rule
public Neo4jRule neo4jRule = new Neo4jRule(newManagedNeoServerConfiguration().build());
~~~~

or you can use the fast way:

~~~~ {.java}
@Rule
public Neo4jRule neo4jRule = newNeo4jRule().defaultManagedNeo4j();
~~~~

### Spring Connection

If you are plannig to use **Spring Data Neo4j**, you may require to use the *GraphDatabaseService* defined within Spring Application Context, mostly because you are defining an embedded connection using Spring namespace:

~~~~ {.xml}

~~~~

In these cases you should use an special method which gets *GraphDatabaseService* instance instead of creating new one.

~~~~ {.java}
@Autowired
private ApplicationContext applicationContext;

@Rule
public Neo4jRule neo4jRule = newNeo4jRule().defaultSpringGraphDatabaseServiceNeo4j();
~~~~

> Note that you need to autowire the application context, so **NoSQLUnit** can inject instance defined within application context into *Neo4jRule*.

### Verifying Graph

@ShouldMatchDataSet is also supported for *Neo4j* graphs but we should
keep in mind some considerations.

To compare two graphs, stored graph is exported into
[GraphML](#ex.neo4j_dataset) format and then is compared with expected
*GraphML* using *XmlUnit* framework. This approach implies two aspects
to be considered, the first one is that although your graph does not
contains any connection to reference node, reference node will appear
too with the form ( \\ ). The other aspect is that
id's are *Neo4j's* internal id, so when you write the expected file,
remember to follow the same id strategy followed by *Neo4j* so id
attribute of each node could be matched correctly with generated output.
Inserted nodes' id starts from 1 (0 is reserved for reference node),
meanwhile edges starts from 0.

This way to compare graphs may change in future (although this strategy
will be always supported).

As I have noted in [verification section](#verifying_database) I find
that using @ShouldMatchDataSet is a bad approach during testing because
test readibility is affected negatively. So as general guide, my advice
is to try to avoid using @ShouldMatchDataSet in your tests as much as
possible.

### Full Example

To show how to use **NoSQLUnit** with *Neo4j* , we are going to create a
very simple application that counts Neo's friends.

[MatrixManager](#program.matrix_neo4j_manager) is the business class
responsible of inserting new friends and counting the number of Neo's
friends.

~~~~ {.java}
public class MatrixManager {

public enum RelTypes implements RelationshipType {
NEO_NODE, KNOWS, CODED_BY
}

private GraphDatabaseService graphDb;

public MatrixManager(GraphDatabaseService graphDatabaseService) {
this.graphDb = graphDatabaseService;
}

public int countNeoFriends() {

Node neoNode = getNeoNode();
Traverser friendsTraverser = getFriends(neoNode);

return friendsTraverser.getAllNodes().size();

}

public void addNeoFriend(String name, int age) {
Transaction tx = this.graphDb.beginTx();
try {
Node friend = this.graphDb.createNode();
friend.setProperty("name", name);
Relationship relationship = getNeoNode().createRelationshipTo(friend, RelTypes.KNOWS);
relationship.setProperty("age", age);
tx.success();
} finally {
tx.finish();
}
}

private static Traverser getFriends(final Node person) {
return person.traverse(Order.BREADTH_FIRST, StopEvaluator.END_OF_GRAPH, ReturnableEvaluator.ALL_BUT_START_NODE,
RelTypes.KNOWS, Direction.OUTGOING);
}

private Node getNeoNode() {
return graphDb.getReferenceNode().getSingleRelationship(RelTypes.NEO_NODE, Direction.OUTGOING).getEndNode();
}

}
~~~~

And now one unit test and one integration test is written:

For [unit](#program.matrix_neo4j_unit) test we are going to use embedded
approach:

~~~~ {.java}
import static org.junit.Assert.assertThat;
import static org.hamcrest.CoreMatchers.is;
import static com.lordofthejars.nosqlunit.neo4j.EmbeddedNeo4j.EmbeddedNeo4jRuleBuilder.newEmbeddedNeo4jRule;
import static com.lordofthejars.nosqlunit.neo4j.EmbeddedNeoServerConfigurationBuilder.newEmbeddedNeoServerConfiguration;

import javax.inject.Inject;

import org.junit.ClassRule;
import org.junit.Rule;
import org.junit.Test;
import org.neo4j.graphdb.GraphDatabaseService;

import com.lordofthejars.nosqlunit.annotation.UsingDataSet;
import com.lordofthejars.nosqlunit.core.LoadStrategyEnum;
import com.lordofthejars.nosqlunit.neo4j.EmbeddedNeo4j;
import com.lordofthejars.nosqlunit.neo4j.Neo4jRule;

public class WhenNeoFriendsAreRequired {

@ClassRule
public static EmbeddedNeo4j embeddedNeo4j = newEmbeddedNeo4jRule().build();

@Rule
public Neo4jRule neo4jRule = new Neo4jRule(newEmbeddedNeoServerConfiguration().build(), this);

@Inject
private GraphDatabaseService graphDatabaseService;

@Test
@UsingDataSet(locations="matrix.xml", loadStrategy=LoadStrategyEnum.CLEAN_INSERT)
public void all_direct_and_inderectly_friends_should_be_counted() {
MatrixManager matrixManager = new MatrixManager(graphDatabaseService);
int countNeoFriends = matrixManager.countNeoFriends();
assertThat(countNeoFriends, is(3));
}

}
~~~~

And as [integration test](#program.matrix_neo4j_integration) , the
managed one:

~~~~ {.java}
import static com.lordofthejars.nosqlunit.neo4j.ManagedWrappingNeoServer.ManagedWrappingNeoServerRuleBuilder.newWrappingNeoServerNeo4jRule;
import static com.lordofthejars.nosqlunit.neo4j.ManagedNeoServerConfigurationBuilder.newManagedNeoServerConfiguration;

import javax.inject.Inject;

import org.junit.ClassRule;
import org.junit.Rule;
import org.junit.Test;
import org.neo4j.graphdb.GraphDatabaseService;

import com.lordofthejars.nosqlunit.annotation.ShouldMatchDataSet;
import com.lordofthejars.nosqlunit.annotation.UsingDataSet;
import com.lordofthejars.nosqlunit.core.LoadStrategyEnum;
import com.lordofthejars.nosqlunit.neo4j.ManagedWrappingNeoServer;
import com.lordofthejars.nosqlunit.neo4j.Neo4jRule;

public class WhenNeoMeetsANewFriend {

@ClassRule
public static ManagedWrappingNeoServer managedWrappingNeoServer = newWrappingNeoServerNeo4jRule().build();

@Rule
public Neo4jRule neo4jRule = new Neo4jRule(newManagedNeoServerConfiguration().build(), this);

@Inject
private GraphDatabaseService graphDatabaseService;

@Test
@UsingDataSet(locations="matrix.xml", loadStrategy=LoadStrategyEnum.CLEAN_INSERT)
@ShouldMatchDataSet(location="expected-matrix.xml")
public void friend_should_be_related_into_neo_graph() {

MatrixManager matrixManager = new MatrixManager(graphDatabaseService);
matrixManager.addNeoFriend("The Oracle", 4);
}

}
~~~~

Note that in both cases we are using the same dataset as initial state,
which looks like:

~~~~ {.xml}





Thomas Anderson


Trinity


Morpheus


Agent Smith


The Architect




3


5


18


20


20

~~~~

Cassandra Engine
================

Cassandra
=========

Cassandra is a BigTable data model running on an Amazon Dynamo-like
infrastructure.

**NoSQLUnit** supports *Cassandra* by using next classes:

---------- ---------------------------------------------------------
Embedded com.lordofthejars.nosqlunit.cassandra.EmbeddedCassandra
Managed com.lordofthejars.nosqlunit.cassandra.ManagedCassandra
---------- ---------------------------------------------------------

: Lifecycle Management Rules

---------------------- -----------------------------------------------------
NoSQLUnit Management com.lordofthejars.nosqlunit.cassandra.CassandraRule
---------------------- -----------------------------------------------------

: Manager Rule

Maven Setup
-----------

To use **NoSQLUnit** with Cassandra you only need to add next
dependency:

~~~~ {.xml}

com.lordofthejars
nosqlunit-cassandra
${version.nosqlunit}

~~~~

Dataset Format
--------------

Default dataset file format in *Cassandra* module is json. To make
compatible **NoSQLUnit** with
[Cassandra-Unit](https://github.com/jsevellec/cassandra-unit/) file
format, DataLoader of Cassandra-Unit project is used, so same json
format file is used.

Datasets must have next [format](#ex.cassandra_dataset) :

~~~~ {.json}
{
"name" : "",
"replicationFactor" : "",
"strategy" : "",
"columnFamilies" : [{
"name" : "",
"type" : "",
"keyType" : "",
"comparatorType" : "",
"subComparatorType" : "",
"defaultColumnValueType" : "",
"comment" : "",
"compactionStrategy" : "",
"compactionStrategyOptions" : [{
"name" : "",
"value": ""
}],
"gcGraceSeconds" : "",
"maxCompactionThreshold" : "",
"minCompactionThreshold" : "",
"readRepairChance" : "",
"replicationOnWrite" : "",
"columnsMetadata" : [{
"name" : "",
"validationClass : "",
"indexType" : "",
"indexName" : ""
},
...
]
"rows" : [{
"key" : "",
"columns" : [{
"name" : "",
"value" : ""
},
...
],
...
// OR
...
"superColumns" : [{
"name" : "",
"columns" : [{
"name" : "",
"value" : ""
},
...
]
},
...
]
},
...
]
},
...
]
}
~~~~

See [Cassandra-Unit
Dataset](https://github.com/jsevellec/cassandra-unit/wiki/What-can-you-set-into-a-dataSet)
format for more information.

Getting Started
---------------

### Lifecycle Management Strategy

First step is defining which lifecycle management strategy is required
for your tests. Depending on kind of test you are implementing (unit
test, integration test, deployment test, ...) you will require an
embedded approach, managed approach or remote approach.

#### Embedded Lifecycle

To configure **embedded** approach you should only instantiate next
[rule](#program.cassandra_embedded_conf) :

~~~~ {.java}
@ClassRule
public static EmbeddedCassandra embeddedCassandraRule = newEmbeddedCassandraRule().build();
~~~~

By default embedded *Cassandra* rule uses next default values:

------------------------------ --------------------------------------------------------------------------------------------------------------------------------------------
Target path This is the directory where *Cassandra* server is started and is `target/cassandra-temp` .
Cassandra Configuration File Location of yaml configuration file. By default a configuration file is provided with correct default parameters.
Host localhost
Port By default port used is 9171. Port cannot be configured, and cannot be changed if you provide an alternative Cassandra Configuration File.
------------------------------ --------------------------------------------------------------------------------------------------------------------------------------------

: Default Embedded Values

#### Managed Lifecycle

To configure **managed** approach you should only instantiate next
[rule](#program.cassandra_managed_conf) :

~~~~ {.java}
@ClassRule
public static ManagedCassandra managedCassandra = newManagedCassandraRule().build();
~~~~

By default managed *Cassandra* rule uses next default values but can be
configured programmatically:

--------------- ------------------------------------------------------------------------------------------------------------------------------
Target path This is the directory where *Cassandra* server is started and is `target/cassandra-temp` .
CassandraPath *Cassandra* installation directory which by default is retrieved from `CASSANDRA_HOME` system environment variable.
Port By default port used is 9160. If port is changed in *Cassandra* configuration file, this port should be configured too here.
--------------- ------------------------------------------------------------------------------------------------------------------------------

: Default Managed Values

> **Warning**
>
> To start
> Cassandra
> java.home
> must be set. Normally this variable is already configured, you would
> need to do nothing.

#### Remote Lifecycle

Configuring **remote** approach does not require any special rule
because you (or System like Maven ) is the responsible of starting and
stopping the server. This mode is used in deployment tests where you are
testing your application on real environment.

### Configuring Cassandra Connection

Next step is configuring **Cassandra** rule in charge of maintaining
*Cassandra* graph into known state by inserting and deleting defined
datasets. You must register CassandraRule *JUnit* rule class, which
requires a configuration parameter with information like host, port, or
cluster name.

To make developer's life easier and code more readable, a fluent
interface can be used to create these configuration objects. Three
different kind of configuration builders exist.

#### Embedded Connection

The first one is for configuring a connection to embedded *Cassandra* .

~~~~ {.java}
import static com.lordofthejars.nosqlunit.cassandra.EmbeddedCassandraConfigurationBuilder.newEmbeddedCassandraConfiguration;

@Rule
public CassandraRule cassandraRule = new CassandraRule(newEmbeddedCassandraConfiguration().clusterName("Test Cluster").build());
~~~~

Host and port parameters are already configured.

#### Managed Connection

The first one is for configuring a connection to managed *Cassandra* .

~~~~ {.java}
import static com.lordofthejars.nosqlunit.cassandra.ManagedCassandraConfigurationBuilder.newManagedCassandraConfiguration;

@Rule
public CassandraRule cassandraRule = new CassandraRule(newManagedCassandraConfiguration().clusterName("Test Cluster").build());
~~~~

Host and port parameters are already configured with default parameters
of managed lifecycle. If port is changed, this class provides a method
to set it.

#### Remote Connection

Configuring a connection to remote *Cassandra* .

~~~~ {.java}
import static com.lordofthejars.nosqlunit.cassandra.RemoteCassandraConfigurationBuilder.newRemoteCassandraConfiguration;

@Rule
public CassandraRule cassandraRule = new CassandraRule(newRemoteCassandraConfiguration().host("192.168.1.1").clusterName("Test Cluster").build());
~~~~

Port parameter is already configured with default parameter of managed
lifecycle. If port is changed, this class provides a method to set it.
Note that host parameter must be specified in this case.

### Verifying Data

@ShouldMatchDataSet is also supported for *Cassandra* data but we should
keep in mind some considerations.

> **Warning**
>
> In
> NoSQLUnit
> , expectations can only be used over data, not over configuration
> parameters, so for example fields set in
> dataset
> file like compactionStrategy, gcGraceSeconds or maxCompactionThreshold
> are not used. Maybe in future will be supported but for now only data
> (keyspace, columnfamilyname, columns, supercolumns, ...) are
> supported.

### Full Example

To show how to use **NoSQLUnit** with *Cassandra* , we are going to
create a very simple application.

[PersonManager](#program.person_cassandra_manager) is the business class
responsible of getting and updating person's car.

~~~~ {.java}
public class PersonManager {

private ColumnFamilyTemplate template;

public PersonManager(String clusterName, String keyspaceName, String host) {
Cluster cluster = HFactory.getOrCreateCluster(clusterName, host);
Keyspace keyspace = HFactory.createKeyspace(keyspaceName, cluster);

template = new ThriftColumnFamilyTemplate(keyspace,
"personFamilyName",
StringSerializer.get(),
StringSerializer.get());

}

public String getCarByPersonName(String name) {
ColumnFamilyResult queryColumns = template.queryColumns(name);
return queryColumns.getString("car");
}

public void updateCarByPersonName(String name, String car) {
ColumnFamilyUpdater createUpdater = template.createUpdater(name);
createUpdater.setString("car", car);

template.update(createUpdater);
}

}
~~~~

And now one unit test and one integration test is written:

For [unit](#program.person_cassandra_unit) test we are going to use
embedded approach:

~~~~ {.java}
import static org.junit.Assert.assertThat;
import static org.hamcrest.CoreMatchers.is;

import static com.lordofthejars.nosqlunit.cassandra.EmbeddedCassandra.EmbeddedCassandraRuleBuilder.newEmbeddedCassandraRule;
import static com.lordofthejars.nosqlunit.cassandra.EmbeddedCassandraConfigurationBuilder.newEmbeddedCassandraConfiguration;

import org.junit.ClassRule;
import org.junit.Rule;
import org.junit.Test;

import com.lordofthejars.nosqlunit.annotation.UsingDataSet;
import com.lordofthejars.nosqlunit.cassandra.CassandraRule;
import com.lordofthejars.nosqlunit.cassandra.EmbeddedCassandra;
import com.lordofthejars.nosqlunit.core.LoadStrategyEnum;

public class WhenPersonWantsToKnowItsCar {

@ClassRule
public static EmbeddedCassandra embeddedCassandraRule = newEmbeddedCassandraRule().build();

@Rule
public CassandraRule cassandraRule = new CassandraRule(newEmbeddedCassandraConfiguration().clusterName("Test Cluster").build());

@Test
@UsingDataSet(locations="persons.json", loadStrategy=LoadStrategyEnum.CLEAN_INSERT)
public void car_should_be_returned() {

PersonManager personManager = new PersonManager("Test Cluster", "persons", "localhost:9171");
String car = personManager.getCarByPersonName("mary");

assertThat(car, is("ford"));

}

}
~~~~

And as [integration test](#program.person_cassandra_integration) , the
managed one:

~~~~ {.java}
import static com.lordofthejars.nosqlunit.cassandra.ManagedCassandraConfigurationBuilder.newManagedCassandraConfiguration;
import static com.lordofthejars.nosqlunit.cassandra.ManagedCassandra.ManagedCassandraRuleBuilder.newManagedCassandraRule;

import org.junit.ClassRule;
import org.junit.Rule;
import org.junit.Test;

import com.lordofthejars.nosqlunit.annotation.ShouldMatchDataSet;
import com.lordofthejars.nosqlunit.annotation.UsingDataSet;
import com.lordofthejars.nosqlunit.cassandra.CassandraRule;
import com.lordofthejars.nosqlunit.cassandra.ManagedCassandra;
import com.lordofthejars.nosqlunit.core.LoadStrategyEnum;

public class WhenPersonWantsToUpdateItsCar {

static {
System.setProperty("CASSANDRA_HOME", "/opt/cassandra");
}

@ClassRule
public static ManagedCassandra managedCassandra = newManagedCassandraRule().build();

@Rule
public CassandraRule cassandraRule = new CassandraRule(newManagedCassandraConfiguration().clusterName("Test Cluster").build());

@Test
@UsingDataSet(locations="persons.json", loadStrategy=LoadStrategyEnum.CLEAN_INSERT)
@ShouldMatchDataSet(location="expected-persons.json")
public void new_car_should_be_updated() {

PersonManager personManager = new PersonManager("Test Cluster", "persons", "localhost:9171");
personManager.updateCarByPersonName("john", "opel");

}

}
~~~~

Note that in both cases we are using the same dataset as initial state,
which looks like:

~~~~ {.json}
{
"name" : "persons",
"columnFamilies" : [{
"name" : "personFamilyName",
"keyType" : "UTF8Type",
"defaultColumnValueType" : "UTF8Type",
"comparatorType" : "UTF8Type",
"rows" : [{
"key" : "john",
"columns" : [{
"name" : "age",
"value" : "22"
},
{
"name" : "car",
"value" : "toyota"
}]
},
{
"key" : "mary",
"columns" : [{
"name" : "age",
"value" : "33"
},
{
"name" : "car",
"value" : "ford"
}]
}]
}]
}
~~~~

Redis Engine
============

Redis
=====

Redis is an open source, advanced key-value store. It is often referred
to as a data structure server since keys can contain strings, hashes,
lists, sets and sorted sets.

**NoSQLUnit** supports *Redis* by using next classes:

---------- -------------------------------------------------
Managed com.lordofthejars.nosqlunit.redis.ManagedRedis
---------- -------------------------------------------------

: Lifecycle Management Rules

---------------------- ---------------------------------------------
NoSQLUnit Management com.lordofthejars.nosqlunit.redis.RedisRule
---------------------- ---------------------------------------------

: Manager Rule

Maven Setup
-----------

To use **NoSQLUnit** with Redis you only need to add next dependency:

~~~~ {.xml}

com.lordofthejars
nosqlunit-redis
${version.nosqlunit}

~~~~

Dataset Format
--------------

Default dataset file format in *Redis* module is json.

Datasets must have next [format](#ex.redis_dataset) :

~~~~ {.json}
{
"data":[
{"simple": [
{
"key":"key1",
"value":"value1"
}
]
},
{"list": [{
"key":"key3",
"values":[
{"value":"value3"},
{"value":"value4"}
]
}]
},

{"sortset": [{
"key":"key4",
"values":[
{"score":2, "value":"value5" },{"score":3, "value":1 }, {"score":1, "value":"value6" }]
}]
},
{"hash": [
{
"key":"user",
"values":[
{"field":"name", "value":"alex"},
{"field":"password", "value":"alex"}
]
}
]
},
{"set":[{
"key":"key3",
"values":[
{"value":"value3"},
{"value":"value4"}
]
}]
}
]
}
~~~~

Root element must be called *data* , and then depending on kind of
structured data we need to store, one or more of next elements should
appear. Note that key field is used to set the key of the element, and
value field is used to set a value.

- *simple* : In case we want to store simple key/value elements. This
element will contain an array of key/value entries.

- *list* : In case we want to store a key with a list of values. This
element contain a *key* field for key name and *values* field with
an array of values.

- *set* In case we want to store a key within a set (no duplicates
allowed). Structure is the same as list element.

- *sortset* : In case we want to store a key within a sorted set. This
element contain the key, and an array of values, which each one,
apart from value field, also contain *score* field of type Number,
to set the order into sorted set.

- *hash* : In case we want to store a key within a map of field/value.
In this case *field* element set the field name, and *value* set the
value of that field.

Getting Started
---------------

### Lifecycle Management Strategy

First step is defining which lifecycle management strategy is required
for your tests. Depending on kind of test you are implementing (unit
test, integration test, deployment test, ...) you will require an
managed approach or remote approach.

#### Managed Lifecycle

To configure **managed** approach you should only instantiate next
[rule](#program.redis_managed_conf) :

~~~~ {.java}
@ClassRule
public static ManagedRedis managedRedis = newManagedRedisRule().redisPath("/opt/redis-2.4.16").build();
~~~~

By default managed *Redis* rule uses next default values but can be
configured programmatically:

-------------------- -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
Target path This is the directory where *Redis* server is started and is `target/redis-temp` .
RedisPath *Cassandra* installation directory which by default is retrieved from `REDIS_HOME` system environment variable.
Port By default port used is 6379. If port is changed in *Redis* configuration file, this port should be configured too here.
Configuration File By default *Redis* can work with no configuration file, it uses default values, but if we need to start *Redis* with an specific configuration file located in any directory file path should be set.
-------------------- -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------

: Default Managed Values

#### Remote Lifecycle

Configuring **remote** approach does not require any special rule
because you (or System like Maven ) is the responsible of starting and
stopping the server. This mode is used in deployment tests where you are
testing your application on real environment.

### Configuring Redis Connection

Next step is configuring **Redis** rule in charge of maintaining *Redis*
store into known state by inserting and deleting defined datasets. You
must register RedisRule *JUnit* rule class, which requires a
configuration parameter with information like host, port, or cluster
name.

To make developer's life easier and code more readable, a fluent
interface can be used to create these configuration objects. Three
different kind of configuration builders exist.

#### Managed Connection

Configuring a connection to managed *Redis* .

~~~~ {.java}
import static com.lordofthejars.nosqlunit.redis.ManagedRedisConfigurationBuilder.newManagedRedisConfiguration;

@Rule
public RedisRule redisRule = new RedisRule(newManagedRedisConfiguration().build());

~~~~

Host and port parameters are already configured with default parameters
of managed lifecycle. If port is changed, this class provides a method
to set it.

#### Remote Connection

Configuring a connection to remote *Redis* .

~~~~ {.java}
import static com.lordofthejars.nosqlunit.redis.RemoteRedisConfigurationBuilder.newRemoteRedisConfiguration;

@Rule
public RedisRule redisRule = new RedisRule(newRemoteRedisConfiguration().host("192.168.1.1").build());
~~~~

Port parameter is already configured with default parameter of managed
lifecycle. If port is changed, this class provides a method to set it.
Note that host parameter must be specified in this case.

#### Shard Connection

Redis connection also be configured as shard using ShardedJedis
capabilities.

~~~~ {.java}
import static com.lordofthejars.nosqlunit.redis.RemoteRedisConfigurationBuilder.newRemoteRedisConfiguration;

@Rule
public RedisRule redisRule = new RedisRule(newShardedRedisConfiguration()
.shard(host("127.0.0.1"), port(ManagedRedis.DEFAULT_PORT))
.password("a")
.timeout(1000)
.weight(1000)
.shard(host("127.0.0.1"), port(ManagedRedis.DEFAULT_PORT + 1))
.password("b")
.timeout(3000)
.weight(3000)
.build(););
~~~~

Note that only host and port is mandatory, the other ones uses default
values.

- *password* : In case repository is protected with password this
attribute is used as password. Default values is null.

- *timeout* : Timeout for shard. By default timeout is set to 2
seconds.

- *weight* : The weight of that shard over the other ones. By default
is 1.

### Verifying Data

@ShouldMatchDataSet is also supported for *Redis* engine.

### Full Example

To show how to use **NoSQLUnit** with *Redis* , we are going to create a
very simple application.

[BookManager](#program.book_redis_manager) is the business class
responsible of inserting new books and finding books by their title.

~~~~ {.java}
public class BookManager {

private static final String TITLE_FIELD_NAME = "title";
private static final String NUMBER_OF_PAGES = "numberOfPages";

private Jedis jedis;

public BookManager(Jedis jedis) {
this.jedis = jedis;
}

public void insertBook(Book book) {

Map fields = new HashMap();

fields.put(TITLE_FIELD_NAME, book.getTitle());
fields.put(NUMBER_OF_PAGES, Integer.toString(book.getNumberOfPages()));

jedis.hmset(book.getTitle(), fields);
}

public Book findBookByTitle(String title) {

Map fields = jedis.hgetAll(title);
return new Book(fields.get(TITLE_FIELD_NAME), Integer.parseInt(fields.get(NUMBER_OF_PAGES)));

}

}
~~~~

And now one integration test is written:

~~~~ {.java}
import static com.lordofthejars.nosqlunit.redis.RedisRule.RedisRuleBuilder.newRedisRule;
import static com.lordofthejars.nosqlunit.redis.ManagedRedis.ManagedRedisRuleBuilder.newManagedRedisRule;

import static org.junit.Assert.assertThat;
import static org.hamcrest.CoreMatchers.is;

import org.junit.ClassRule;
import org.junit.Rule;
import org.junit.Test;

import redis.clients.jedis.Jedis;

import com.lordofthejars.nosqlunit.annotation.UsingDataSet;
import com.lordofthejars.nosqlunit.core.LoadStrategyEnum;
import com.lordofthejars.nosqlunit.demo.model.Book;
import com.lordofthejars.nosqlunit.redis.ManagedRedis;
import com.lordofthejars.nosqlunit.redis.RedisRule;

public class WhenYouFindABook {

static {
System.setProperty("REDIS_HOME", "/opt/redis-2.4.16");
}

@ClassRule
public static ManagedRedis managedRedis = newManagedRedisRule().build();

@Rule
public RedisRule redisRule = newRedisRule().defaultManagedRedis();

@Test
@UsingDataSet(locations="book.json", loadStrategy=LoadStrategyEnum.CLEAN_INSERT)
public void book_should_be_returned_if_title_is_in_database() {

BookManager bookManager = new BookManager(new Jedis("localhost"));
Book findBook = bookManager.findBookByTitle("The Hobbit");

assertThat(findBook, is(new Book("The Hobbit", 293)));

}

}
~~~~

And dataset used is:

~~~~ {.json}
{
"data":[
{"hash": [
{
"key":"The Hobbit",
"values":[
{"field":"title", "value":"The Hobbit"},
{"field":"numberOfPages", "value":"293"}
]
}
]
}
]
}
~~~~

HBase Engine
============

HBase
=====

*Apache HBase* is an open-source, distributed, versioned, column-oriented
store.

*NoSQLUnit* supports *HBase* by using next classes:

---------- -------------------------------------------------
Embedded com.lordofthejars.nosqlunit.hbase.EmbeddedHBase
Managed com.lordofthejars.nosqlunit.hbase.ManagedHBase
---------- -------------------------------------------------

: Lifecycle Management Rules

---------------------- ---------------------------------------------
NoSQLUnit Management com.lordofthejars.nosqlunit.hbase.HBaseRule
---------------------- ---------------------------------------------

: Manager Rule

Maven Setup
-----------

To use *NoSQLUnit* with HBase you only need to add next dependency:

~~~~ {.xml}

com.lordofthejars
nosqlunit-hbase
${version.nosqlunit}

~~~~

Dataset Format
--------------

Default dataset file format in *HBase* module is json. Dataset in HBase
is the same used by
[Cassandra-Unit](https://github.com/jsevellec/cassandra-unit/) but not all
fields are supported. Only fields available in TSV HBase application can
be set into dataset.

So as summary datasets must have next [format](#ex.hbase_dataset) :

~~~~ {.json}
{
"name" : "tablename",
"columnFamilies" : [{
"name" : "columnFamilyName",
"rows" : [{
"key" : "key1",
"columns" : [{
"name" : "columnName",
"value" : "columnValue"
},
...
]
},
...
]
},
...
]
}
~~~~

Getting Started
---------------

### Lifecycle Management Strategy

First step is defining which lifecycle management strategy is required
for your tests. Depending on kind of test you are implementing (unit
test, integration test, deployment test, ...) you will require an
embedded approach, managed approach or remote approach.

#### Embedded Lifecycle

To configure *embedded* approach you should only instantiate next
[rule](#program.hbase_embedded_conf) :

~~~~ {.java}
@ClassRule
public static EmbeddedHBase embeddedHBase = newEmbeddedHBaseRule().build();
~~~~

By default embedded *Embedded* rule uses HBaseTestingUtility default
values:

------------------ --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
Target path This is the directory where *HBase* stores data and is `target/data` .
Host localhost
Port By default port used is 60000.
File Permissions Depending on your umask configuration, HBaseTestingUtility will create some directories that will not be accessible during runtime. By default this value is set to 775, but depending on your OS you may require a different value.
------------------ --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------

: Default Embedded Values

#### Managed Lifecycle

To configure *managed* approach you should only instantiate next
[rule](#program.hbase_managed_conf) :

~~~~ {.java}
@ClassRule
public static ManagedHBase managedHBase = newManagedHBaseServerRule().build();
~~~~

By default managed *HBase* rule uses next default values but can be
configured programmatically:

--------------- ---------------------------------------------------------------------------------------------------------------------------
Target path This is the directory where *HBase* server is started and is `target/hbase-temp` .
HBasePath *HBase* installation directory which by default is retrieved from `HBASE_HOME` system environment variable.
Port By default port used is 60000. If port is changed in *HBase* configuration file, this port should be configured too here.
--------------- ---------------------------------------------------------------------------------------------------------------------------

: Default Managed Values

> **Warning**
>
> To start
> HBASE
> JAVA\_HOME
> must be set. Normally this variable is already configured, so you would
> need to do nothing.

#### Remote Lifecycle

Configuring *remote* approach does not require any special rule because
you (or System like Maven ) is the responsible of starting and stopping
the server. This mode is used in deployment tests where you are testing
your application on real environment.

### Configuring HBase Connection

Next step is configuring *HBase* rule in charge of maintaining *HBase*
columns into known state by inserting and deleting defined datasets. You
must register HBaseRule *JUnit* rule class, which requires a
configuration parameter with some information.

To make developer's life easier and code more readable, a fluent
interface can be used to create these configuration objects. Three
different kind of configuration builders exist.

#### Embedded Connection

The first one is for configuring a connection to embedded *HBase* .

~~~~ {.java}
import static com.lordofthejars.nosqlunit.hbase.EmbeddedHBase.EmbeddedHBaseRuleBuilder.newEmbeddedHBaseRule;

@Rule
public HBaseRule hBaseRule = newHBaseRule().defaultEmbeddedHBase();
~~~~

Embedded HBase does not require any special parameter. Configuration
object is copied from Embedded rule directly to HBaseRule.

#### Managed Connection

This is for configuring a connection to managed *HBase* .

~~~~ {.java}
import static com.lordofthejars.nosqlunit.hbase.ManagedHBaseConfigurationBuilder.newManagedHBaseConfiguration;

@Rule
public HBaseRule hbaseRule = new HBaseRule(newManagedHBaseConfiguration().build());
~~~~

By default configuration used is the one loaded by calling
HBaseConfiguration.create() method.
[HBaseConfiguration.create()](http://hbase.apache.org/apidocs/org/apache/hadoop/hbase/HBaseConfiguration.html#create())
which uses hbase-site.xml and hbase-default.xml classpath files.

But also a method `setProperty` method is provided to modify any
parameter of generated configuration object.

#### Remote Connection

Configuring a connection to remote *HBase* uses same approach like
ManagedHBase configuration object but using
com.lordofthejars.nosqlunit.hbase.RemoteHBaseConfigurationBuilder class
instead of
com.lordofthejars.nosqlunit.hbase.ManagedHBaseConfigurationBuilder. .

> **Warning**
>
> Working with Apache HBase required a bit of knowledge about how it
> works. For example your /etc/hosts file cannot contain a reference to
> your host name with ip 127.0.1.1.
>
> Moreover *NoSQLUnit* uses *HBase-0.94.1* and this version should be
> also installed in your computer to work with managed or remote approach. If you
> install another version, you should exclude these artifacts from
> *NoSQLUnit* dependencies, and add the new ones manually to your pom
> file.

### Verifying Data

@ShouldMatchDataSet is also supported for *HBase* data but we should
keep in mind some considerations.

If you plan to verify data with @ShouldMatchDataSet in Managed and
Remote approach, you should enable Aggregate coprocessor by editing
hbase-site-xml file and adding next lines:

~~~~ {.xml}

hbase.coprocessor.user.region.classes
org.apache.hadoop.hbase.coprocessor.AggregateImplementation

~~~~

### Full Example

To show how to use *NoSQLUnit* with *HBase* , we are going to create a
very simple application.

[PersonManager](#program.person_hbase_manager) is the business class
responsible of getting and updating person's car.

~~~~ {.java}
public class PersonManager {

private Configuration configuration;

public PersonManager(Configuration configuration) {
this.configuration = configuration;
}

public String getCarByPersonName(String personName) throws IOException {
HTable table = new HTable(configuration, "person");
Get get = new Get("john".getBytes());
Result result = table.get(get);

return new String(result.getValue(toByteArray().convert("personFamilyName"), toByteArray().convert("car")));
}

private Converter toByteArray() {
return new Converter() {

@Override
public byte[] convert(String element) {
return element.getBytes();
}
};
}

}
~~~~

And now one unit test is written:

For [unit](#program.person_hbase_unit) test we are going to use embedded
approach:

~~~~ {.java}
public class WhenPersonWantsToKnowItsCar {

@ClassRule
public static EmbeddedHBase embeddedHBase = newEmbeddedHBaseRule().build();

@Rule
public HBaseRule hBaseRule = newHBaseRule().defaultEmbeddedHBase(this);

@Inject
private Configuration configuration;

@Test
@UsingDataSet(locations="persons.json", loadStrategy=LoadStrategyEnum.CLEAN_INSERT)
public void car_should_be_returned() throws IOException {

PersonManager personManager = new PersonManager(configuration);
String car = personManager.getCarByPersonName("john");

assertThat(car, is("toyota"));
}

}
~~~~

And dataset used is:

~~~~ {.json}
{
"name" : "person",
"columnFamilies" : [{
"name" : "personFamilyName",
"rows" : [{
"key" : "john",
"columns" : [{
"name" : "age",
"value" : "22"
},
{
"name" : "car",
"value" : "toyota"
}]
},
{
"key" : "mary",
"columns" : [{
"name" : "age",
"value" : "33"
},
{
"name" : "car",
"value" : "ford"
}]
}]
}]
}
~~~~

CouchDB Engine
==============

CouchDB
=======

*CouchDB* is a *NoSQL* database that stores structured data as *JSON-like*
documents with dynamic schemas.

**NoSQLUnit** supports *CouchDB* by using next classes:

--------- ----------------------------------------------------
Managed com.lordofthejars.nosqlunit.couchdb.ManagedCouchDb
--------- ----------------------------------------------------

: Lifecycle Management Rules

---------------------- -------------------------------------------------
NoSQLUnit Management com.lordofthejars.nosqlunit.couchdb.CouchDbRule
---------------------- -------------------------------------------------

: Manager Rule

Maven Setup
-----------

To use **NoSQLUnit** with CouchDB you only need to add next dependency:

~~~~ {.xml}

com.lordofthejars
nosqlunit-couchdb
${version.nosqlunit}

~~~~

Dataset Format
--------------

Default dataset file format in *CouchDB* module is *json* .

Datasets must have next [format](#ex.couchdb_dataset) :

~~~~ {.json}
{
"data":
[
{"attribute1":"value1", "atribute2":"value2", ...},
{...}
]
}
~~~~

Notice that if attributes value are integers, double quotes are not
required.

Getting Started
---------------

### Lifecycle Management Strategy

First step is defining which lifecycle management strategy is required
for your tests. Depending on kind of test you are implementing (unit
test, integration test, deployment test, ...) you will require an
**managed** approach or **remote** approach.

There is no *CouchDB* inmemory instance, so only managed or remote
lifecycle can be used.

To configure the **managed** way, you should use ManagedCouchDb rule and
may require some [configuration](#program.couchdb_managed_conf)
parameters.

~~~~ {.java}
import static com.lordofthejars.nosqlunit.couchdb.ManagedCouchDb.ManagedCouchDbRuleBuilder.newManagedCouchDbRule;

@ClassRule
public static ManagedCouchDb managedCouchDb = newManagedCouchDbRule().couchDbPath("/usr/local").build();
~~~~

By default managed *CouchDB* rule uses next default values:

- *CouchDB* installation directory is retrieved from `COUCHDB_HOME`
system environment variable.

- Target path, that is the directory where *CouchDB* server is
started, is `target/couchdb-temp` .

- Port where *CouchDB* will be started. Note that this parameter is
used only as information, if you change port from configuration file
you should change this parameter too. By default*CouchDB* server is
started at `5984` .

Configuring **remote** approach does not require any special rule
because you (or System like Maven ) is the responsible of starting and
stopping the server. This mode is used in deployment tests where you are
testing your application on real environment.

### Configuring CouchDB Connection

Next step is configuring ***CouchDB*** rule in charge of maintaining
*CouchDB* database into known state by inserting and deleting defined
datasets. You must register CouchDbRule *JUnit* rule class, which
requires a configuration parameter with information like host, port or
database name.

To make developer's life easier and code more readable, a fluent
interface can be used to create these configuration objects.

---------------------- -------------------------------
URI http://localhost5984
Authentication No authentication parameters.
Enable SSL false.
Relaxed SSL Settings false.
Caching True.
---------------------- -------------------------------

: Default Managed Configuration Values

~~~~ {.java}
import static com.lordofthejars.nosqlunit.couchdb.CouchDbRule.CouchDbRuleBuilder.newCouchDbRule;

@Rule
public CouchDbRule couchDbRule = newCouchDbRule().defaultManagedCouchDb("books");
~~~~

### Complete Example

Consider a library application, which apart from multiple operations, it
allow us to add new books to system. Our
[model](#example.couchdb_book_model) is as simple as:

~~~~ {.java}
public class Book {

private String title;

private int numberOfPages;

public Book(String title, int numberOfPages) {
super();
this.title = title;
this.numberOfPages = numberOfPages;
}

public void setTitle(String title) {
this.title = title;
}

public void setNumberOfPages(int numberOfPages) {
this.numberOfPages = numberOfPages;
}

public String getTitle() {
return title;
}

public int getNumberOfPages() {
return numberOfPages;
}
}
~~~~

Next business [class](#example.couchdb_book_manager) is the responsible
of managing access to *CouchDB* server:

~~~~ {.java}
private CouchDbConnector connector;

public BookManager(CouchDbConnector connector) {
this.connector = connector;
}

public void create(Book book) {
connector.create(MapBookConverter.toMap(book));
}

public Book findBookById(String id) {
Map map = connector.get(Map.class, id);
return MapBookConverter.toBook(map);
}
~~~~

And now it is time for testing. In next
[test](#example.couchdb_test_find_book) we are going to validate that a
book is found into database.

~~~~ {.java}
public class WhenYouFindBooksById {

@ClassRule
public static ManagedCouchDb managedCouchDb = newManagedCouchDbRule().couchDbPath("/usr/local").build();

@Rule
public CouchDbRule couchDbRule = newCouchDbRule().defaultManagedCouchDb("books");

@Inject
private CouchDbConnector couchDbConnector;

@Test
@UsingDataSet(locations="books.json", loadStrategy=LoadStrategyEnum.CLEAN_INSERT)
public void identified_book_should_be_returned() {

BookManager bookManager = new BookManager(couchDbConnector);
Book book = bookManager.findBookById("1");

assertThat(book.getTitle(), is("The Hobbit"));
assertThat(book.getNumberOfPages(), is(293));

}

}
~~~~

~~~~ {.json}
{
"data":
[
{"_id":"1", "title":"The Hobbit","numberOfPages":"293"}
]
}
~~~~

You can watch full example at
[github](https://github.com/lordofthejars/nosql-unit/tree/master/nosqlunit-demo)
.

Infinispan Engine
============

Infinispan
=====

Infinispan is an open-source transactional in-memory key/value NoSQL datastore & Data Grid.

*NoSQLUnit* supports *Infinispan* by using next classes:

---------- -------------------------------------------------
Embedded com.lordofthejars.nosqlunit.infinispan.EmbeddedInfinispan
Managed com.lordofthejars.nosqlunit.infinispan.ManagedInfinispan
---------- -------------------------------------------------

: Lifecycle Management Rules

---------------------- ---------------------------------------------
NoSQLUnit Management com.lordofthejars.nosqlunit.infinispan.InfinispanRule
---------------------- ---------------------------------------------

: Manager Rule

Maven Setup
-----------

To use *NoSQLUnit* with HBase you only need to add next dependency:

~~~~ {.xml}

com.lordofthejars
nosqlunit-infinispan
${version.nosqlunit}

~~~~

Dataset Format
--------------

Default dataset file format in *Infinispan* module is json. With this dataset you can define the key and the value that will be inserted into *Infinispan*. Value can be a simple types like Integer, String, ..., collection types, like set and list implementations or objects (using default Jackson rules (no annotations required).

So as summary datasets must have next [format](#ex.infinispan_dataset) :

~~~~ {.json}
{
"data": [
{
"key":"alex",
"implementation":"com.lordofthejars.nosqlunit.demo.infinispan.User",
"value": {
"name":"alex",
"age":32
}
},
{
"key":"key1",
"value":1
},
{
"key":"key2",
"implementation":"java.util.HashSet",
"value": [{"value":"a"},{"value":"b"}]
}

]
}
~~~~

Note that first key is inserting an object. You should set its implementation, and set the object properties in json format so Jackson can create the required object. User object only contains getter and setters of properties.
The second key is a simple key, in this case an integer.
The third one is a set of strings. See that we must provide the implementation of collection or an *ArrayList* will be used as default. Also you can define objects instead of simple types.

Getting Started
---------------

### Lifecycle Management Strategy

First step is defining which lifecycle management strategy is required
for your tests. Depending on kind of test you are implementing (unit
test, integration test, deployment test, ...) you will require an
embedded approach, managed approach or remote approach.

#### Embedded Lifecycle

To configure *embedded* approach you should only instantiate next
[rule](#program.infinispan_embedded_conf) :

~~~~ {.java}
@ClassRule
public static final EmbeddedInfinispan EMBEDDED_INFINISPAN = newEmbeddedInfinispanRule().build();
~~~~

By default embedded *Embedded* rule uses EmbeddedCacheManager with default
values:

------------------ --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
Target path This is the directory used for starting Embedded Infinispan and by default is target/infinispan-test-data/impermanent-db, .
Configuration File Configuration file used by Infinispan for configuring the grid. By default no configuration file is provided and default *Infinispan* internal values are used.
------------------ --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------

: Default Embedded Values

#### Managed Lifecycle

To configure *managed* approach you should only instantiate next
[rule](#program.infinispan_managed_conf) :

~~~~ {.java}
@ClassRule
public static ManagedInfinispan managedInfinispan = newManagedInfinispanRule().infinispanPath("/opt/infinispan-5.1.6").build();
~~~~

By default managed *Infinispan* rule uses next default values but can be
configured programmatically:

--------------- ---------------------------------------------------------------------------------------------------------------------------
Target path This is the directory where *Infinispan* server is started and is `target/infinispan-temp` .
InfinispanPath *Infinispan* installation directory which by default is retrieved from `INFINISPAN_HOME` system environment variable.
Port By default port used is 11222.
Protocol By default hotrod is used and internally **NoSQLUnit** uses hotrod too, so it should be desirable to no change it.
--------------- ---------------------------------------------------------------------------------------------------------------------------

#### Remote Lifecycle

Configuring *remote* approach does not require any special rule because
you (or System like Maven ) is the responsible of starting and stopping
the server. This mode is used in deployment tests where you are testing
your application on real environment.

### Configuring Infinispan Connection

Next step is configuring *Infinispan* rule in charge of maintaining *Infinispan*
columns into known state by inserting and deleting defined datasets. You
must register InfinispanRule *JUnit* rule class, which requires a
configuration parameter with some information.

To make developer's life easier and code more readable, a fluent
interface can be used to create these configuration objects. Three
different kind of configuration builders exist.

#### Embedded Connection

The first one is for configuring a connection to embedded *Infinispan* .

~~~~ {.java}
com.lordofthejars.nosqlunit.infinispan.InfinispanRule.InfinispanRuleBuilder.newInfinispanRule;

@Rule
public InfinispanRule infinispanRule = newInfinispanRule().defaultEmbeddedInfinispan();
~~~~

Embedded Infinispan does not require any special parameter. But you can use com.lordofthejars.nosqlunit.infinispan.EmbeddedInfinispanConfigurationBuilder class for creatinga custom configuration object for setting cache name.

#### Managed Connection

This is for configuring a connection to managed *Infinispan* .

~~~~ {.java}
import static com.lordofthejars.nosqlunit.infinispan.ManagedInfinispanConfigurationBuilder.newManagedInfinispanConfiguration;

@Rule
public InfinispanRule infinispanRule = newInfinispanRule.configure(newManagedHBaseConfiguration().build()).build();
~~~~

By default the port used is the 11222, and configuration is used the default ones provided by *Infinispan*. You can also set the configuration properties (used by hotrod client) and cache name.

#### Remote Connection

Configuring a connection to remote *Infinispan* uses same approach like
ManagedInfinispan configuration object but using
com.lordofthejars.nosqlunit.infinispan.RemoteInfinispanConfigurationBuilder class. .

### Verifying Data

@ShouldMatchDataSet is also supported for *Infinispan* data but we should
keep in mind some considerations.

If you plan to verify data with @ShouldMatchDataSet and POJO objects *equals* method is used, so implements it accordantly.

### Full Example

To show how to use *NoSQLUnit* with *Infinispan* , we are going to create a
very simple application.

[UserManager](#program.user_infinispan_manager) is the business class
responsible of getting and addinga user to the system.

~~~~ {.java}
public class UserManager {

private BasicCache cache;

public UserManager(BasicCache cache) {
this.cache = cache;
}

public void addUser(User user) {
this.cache.put(user.getName(), user);
}

public User getUser(String name) {
return this.cache.get(name);
}

}
~~~~

And now one unit test is written:

For [unit](#program.user_infinispan_unit) test we are going to use embedded
approach:

~~~~ {.java}
public class WhenUserIsFoundByName {

@ClassRule
public static final EmbeddedInfinispan EMBEDDED_INFINISPAN = newEmbeddedInfinispanRule().build();

@Rule
public final InfinispanRule infinispanRule = newInfinispanRule().defaultEmbeddedInfinispan();

@Inject
private BasicCache embeddedCache;

@Test
@UsingDataSet(locations="user.json", loadStrategy=LoadStrategyEnum.CLEAN_INSERT)
public void user_should_be_returned() {

UserManager userManager = new UserManager(embeddedCache);
User user = userManager.getUser("alex");

assertThat(user, is(new User("alex", 32)));
}

}
~~~~

And dataset used is:

~~~~ {.json}
{
"data": [
{
"key":"alex",
"implementation":"com.lordofthejars.nosqlunit.demo.infinispan.User",
"value": {
"name":"alex",
"age":32
}
}
]
}
~~~~

And one integration test is written:

For [integration](#program.user_infinispan_integration) test we are going to use managed
approach:

~~~~ {.java}
public class WhenUserIsInserted {

@ClassRule
public static final ManagedInfinispan MANAGED_INFINISPAN = newManagedInfinispanRule().infinispanPath("/opt/infinispan-5.1.6").build();

@Rule
public final InfinispanRule infinispanRule = newInfinispanRule().defaultManagedInfinispan();

@Inject
private BasicCache remoteCache;

@UsingDataSet(loadStrategy=LoadStrategyEnum.DELETE_ALL)
@ShouldMatchDataSet(location="user.json")
@Test
public void user_should_be_available_in_cache() {

UserManager userManager = new UserManager(remoteCache);
userManager.addUser(new User("alex", 32));
}

}
~~~~

Elasticsearch Engine
====================

Elasticsearch is a distributed, RESTful, free/open source search server based on Apache Lucene.

Elasticsearch
=============

**NoSQLUnit** supports **Elasticsearch 1.x** by using next classes:

----------- -----------------------------------------------------
In Memory com.lordofthejars.nosqlunit.elasticsearch.EmbeddedElasticsearch
Managed com.lordofthejars.nosqlunit.elasticsearch.ManagedElasticsearch
----------- -----------------------------------------------------

: Lifecycle Management Rules

---------------------- -------------------------------------------------
NoSQLUnit Management com.lordofthejars.nosqlunit.elasticsearch.ElasticsearchRule
---------------------- -------------------------------------------------

: Manager Rule

**NoSQLUnit** supports **Elasticsearch 2.x** by using next classes:

----------- -----------------------------------------------------
In Memory com.lordofthejars.nosqlunit.elasticsearch2.EmbeddedElasticsearch
Managed com.lordofthejars.nosqlunit.elasticsearch2.ManagedElasticsearch
----------- -----------------------------------------------------

: Lifecycle Management Rules

---------------------- -------------------------------------------------
NoSQLUnit Management com.lordofthejars.nosqlunit.elasticsearch2.ElasticsearchRule
---------------------- -------------------------------------------------

: Manager Rule

Maven Setup
-----------

To use **NoSQLUnit** with **Elasticsearch 1.x** you only need to add next dependency:

~~~~ {.xml}

com.lordofthejars
nosqlunit-elasticsearch
${version.nosqlunit}

~~~~

To use **NoSQLUnit** with **Elasticsearch 2.x** you only need to add next dependency:

~~~~ {.xml}

com.lordofthejars
nosqlunit-elasticsearch2
${version.nosqlunit}

~~~~

Dataset Format
--------------

Default dataset file format in **Elasticsearch** module is *json* .

Datasets must have next format:

~~~~ {.json}
{
"documents":[
{
"document":[
{
"index":{
"indexName":"indexName1",
"indexType":"indexType1",
"indexId":"indexId1"
}
},
{
"index":{
"indexName":"indexName2",
"indexType":"indexType2",
"indexId":"indexId2"
}
},
{
"data":{
"property1":"value1",
"property2": value2
}
}
]
},
...
]
}
~~~~

> Notice that if attributes value are integers, double quotes are not
required. Also you can define as many *index* subdocuments as required, but only one *data* document which will be inserted into **Elasticsearch**.
Moreover property *indexId* is only mandatory if you want to use the inserted data to be validated with *@ShouldMatchDataSet*. Document index is used to run comparisons faster than retrieving all data. If you are not planning to use the expectations capability of **NoSQLUnit** then you are not required to set *indexId* property and **Elasticsearch** will provide one for you.

Getting Started
---------------

### Lifecycle Management Strategy

First step is defining which lifecycle management strategy is required
for your tests. Depending on kind of test you are implementing (unit
test, integration test, deployment test, ...) you will require an
**embedded** approach, **managed** approach or **remote** approach.

#### Embedded

To configure the **embedded** approach with **Elasticsearch 1.x**
you should only instantiate the following rule:

~~~~ {.java}
import static com.lordofthejars.nosqlunit.elasticsearch.EmbeddedElasticsearch.EmbeddedElasticsearchRuleBuilder.newEmbeddedElasticsearchRule;

@ClassRule
public static final EmbeddedElasticsearch EMBEDDED_ELASTICSEARCH = newEmbeddedElasticsearchRule().build();
~~~~

In order to configure the **embedded** approach with **Elasticsearch 2.x**
you should instantiate the following rule:

~~~~ {.java}
import static com.lordofthejars.nosqlunit.elasticsearch2.EmbeddedElasticsearch.EmbeddedElasticsearchRuleBuilder.newEmbeddedElasticsearchRule;

@ClassRule
public static final EmbeddedElasticsearch EMBEDDED_ELASTICSEARCH = newEmbeddedElasticsearchRule().build();
~~~~

By default, the **Elasticsearch** node is started as a client node (`node.local: true`), but this property and all other supported properties can be configured or you can still use the default configuration approach provided by **Elasticsearch** by creating `elasticsearch.yml` file into classpath.

Data will be stored in `target/elasticsearch-test-data/impermanent-db` directory.

#### Managed

To configure the **managed** way with **Elasticsearch 1.x**, we should use the `ManagedElasticsearch` rule and
may require some configuration parameters.

~~~~ {.java}
import static com.lordofthejars.nosqlunit.elasticsearch.ManagedElasticsearch.ManagedElasticsearchRuleBuilder.newManagedElasticsearchRule;

@ClassRule
public static final ManagedElasticsearch MANAGED_ELASTICSEARCH = newManagedElasticsearchRule().elasticsearchPath("/opt/elasticsearch-1.7.5").build();
~~~~

In order to use a **managed** instance with **Elasticsearch 2.x**, use the following rule:

~~~~ {.java}
import static com.lordofthejars.nosqlunit.elasticsearch2.ManagedElasticsearch.ManagedElasticsearchRuleBuilder.newManagedElasticsearchRule;

@ClassRule
public static final ManagedElasticsearch MANAGED_ELASTICSEARCH = newManagedElasticsearchRule().elasticsearchPath("/opt/elasticsearch-2.0.2").build();
~~~~

By default managed *Elasticsearch* rule uses next default values:

- *Elasticsearch* installation directory is retrieved from `ES_HOME`
system environment variable.

- Target path, that is the directory where **Elasticsearch** server is
started `target/elasticsearch-temp` .

ManagedElasticsearch can be created from scratch, but for making life easier,
a *DSL* is provided using ManagedElasticsearchRuleBuilder class as seen in previous example.

#### Remote

Configuring **remote** approach does not require any special rule
because you (or System like Maven ) is the responsible of starting and
stopping the server. This mode is used in deployment tests where you are
testing your application on real environment.

### Configuring Elasticsearch Connection

Next step is configuring ***Elasticsearch*** rule in charge of maintaining
*Elasticsearch* database into known state by inserting and deleting defined
datasets. You must register *ElasticsearchRule* *JUnit* rule class, which
requires a configuration parameter with information like host, port or
database name.

To make developer's life easier and code more readable, a fluent
interface can be used to create these configuration objects. Three
different kind of configuration builders exist.

#### Embedded

The first one is for configuring a connection to embedded *Node* instance. For almost all cases default parameters are enough.

~~~~ {.java}
import static com.lordofthejars.nosqlunit.elasticsearch.ElasticsearchRule.ElasticsearchRuleBuilder.newElasticsearchRule;

@Rule
public ElasticsearchRule elasticsearchRule = newElasticsearchRule().defaultEmbeddedElasticsearch();
~~~~

If you're using **Elasticsearch 2.x**, simply use the classes inside the package `com.lordofthejars.nosqlunit.elasticsearch2`.

If you need to customize embedded connection we can use *EmbeddedElasticsearchConfigurationBuilder* class builder.

#### Managed

The second one is for configuring a connection to managed/remote *Elasticsearch*
server. Default values are:

---------------- -------------------------------
Host localhost
Port 9300
---------------- -------------------------------

And you can create a managed connection with default values as shown in next example:

~~~~ {.java}
import static com.lordofthejars.nosqlunit.elasticsearch.ElasticsearchRule.ElasticsearchRuleBuilder.newElasticsearchRule;

@Rule
public ElasticsearchRule elasticsearchRule = newElasticsearchRule().defaultManagedElasticsearch();
~~~~

If you're using **Elasticsearch 2.x**, simply use the classes inside the package `com.lordofthejars.nosqlunit.elasticsearch2`.

But you can customize connection parameters by using *ManagedElasticsearchConfigurationBuilder* class. There you can set the *Settings* class and define the transport port.

#### Remote

Moreover we can also use *RemoteElasticsearchConfigurationBuilder* which allows us to set the host.

~~~~ {.java}
@Rule
public ElasticsearchRule elasticsearchRule = newElasticsearchRule().configure(remoteElasticsearch().host("10.0.10.1").build()).build();
~~~~

### Full Example

Dataset used:

~~~~ {.json}
{
"documents":[
{
"document":[
{
"index":{
"indexName":"books",
"indexType":"book",
"indexId":"1"
}
},
{
"data":{
"title":"The Hobbit",
"numberOfPages":293
}
}
]
}
]
}
~~~~

Managed test:

~~~~ {.java}
@ClassRule
public static final ManagedElasticsearch MANAGED_EALSTICSEARCH = newManagedElasticsearchRule().elasticsearchPath("/opt/elasticsearch-1.7.5").build();

@Rule
public ElasticsearchRule elasticsearchRule = newElasticsearchRule().defaultManagedElasticsearch();

@Inject
private Client client;

@Test
@UsingDataSet(locations="books.json", loadStrategy=LoadStrategyEnum.CLEAN_INSERT)
public void all_books_should_be_returned() {

BookManager bookManager = new BookManager(client);
List books = bookManager.searchAllBooks();

assertThat(books, hasItems(new Book("The Hobbit", 293)));

}
~~~~

Advanced Usage
==============

Customizing Insertion and Comparation strategy
================================================

**NoSQLUnit** provides a default dataset format, for example in case of *Neo4j*
we are providing a *GraphML* format, or in case of *Cassandra* we are offering *cassandra-unit* format.
But because you may have already written datasets in another format, or because you feel more comfortable with another format,
**NoSQLUnit** provides a way to extend the behaviour of insertion and comparation action.

To create an extension, each engine offers two interfaces (one for insertion and one comparation). They are called with the form:

\ComparisonStrategy and \InsertionStrategy. For example CassandraComparisonStrategy or Neo4jInsertionStrategy.

They provide a method for insert/compare data with inputstream of defined dataset file, and a callback interface with all connection objects.

Apart of that, each engine has a default implementation Default\ComparisonStrategy and Default\InsertionStrategy that can be used as a guide for developing your own extensions.

To register each strategy we must use *@CustomInsertionStrategy* and *@CustomComparisonStrategy* annotations.

Let's see a very simple example where we are defining an alternative insectation strategy of *Redis* system by using properties file instead of json.

~~~~ {.java}
public class PropertiesCustomInsertion implements RedisInsertionStrategy {

@Override
public void insert(RedisConnectionCallback connection, InputStream dataset) throws Throwable {
Properties properties = new Properties();
properties.load(dataset);

BinaryJedisCommands insertionJedis = connection.insertionJedis();

Set> entrySet = properties.entrySet();

for (Entry entry : entrySet) {
String key = (String) entry.getKey();
String value = (String) entry.getValue();
insertionJedis.set(key.getBytes(), value.getBytes());
}

}

}
~~~~

And test:

~~~~ {.java}
@CustomInsertionStrategy(insertionStrategy = PropertiesCustomInsertion.class)
public class WhenPropertiesCustomInsertionStrategyIsRegistered {

@ClassRule
public static EmbeddedRedis embeddedRedis = newEmbeddedRedisRule().build();

@Rule
public RedisRule redisRule = newRedisRule().defaultEmbeddedRedis();

@Inject
public Jedis jedis;

@Test
@UsingDataSet(locations="data.properties", loadStrategy=LoadStrategyEnum.CLEAN_INSERT)
public void data_should_be_inserted_from_properties_file() {
String name = jedis.get("name");
String surname = jedis.get("surname");

assertThat(name, is("alex"));
assertThat(surname, is("soto"));
}

}
~~~~

> **Warning**
>
> Custom annotations are only valid on type scope. The custom strategy will be applied to whole test.
>
> When using custom strategies for inserting and comparing data, *location* attribute of *@UsingDataSet* and *@ShouldMatchDataSet* must be specified.

Embedded In-Memory Redis
========================

When you are writing unit tests you should keep in mind that they must
run fast, which implies, among other things, no interaction with IO
subsystem (disk, network, ...). To avoid this interaction in database
unit tests, there are embedded in-memory databases like *H2* , *HSQLDB*
, *Derby* or in case of *NoSQL* , engines like *Neo4j* or *Cassandra*
have their own implementation. But *Redis* does not have any way to
create an embedded in-memory instance in Java. For this reason I have
written an embedded in-memory *Redis* implementation based on *Jedis*
project.

If you are using **NoSQLUnit** you only have to register embedded
*Redis* rule as described [here](#program.redis_embedded_conf) , and
internally **NoSQLUnit** will create instance for you, and you will be
able to [inject](#advanced.jsr330-title) the instance into your code.

But also can be used outside umbrella of **NoSQLUnit** , by
instantiating manually, as described in next
[example:](#advanced.instantiate-embedded-redis)

~~~~ {.java}
EmbeddedRedisBuilder embeddedRedisBuilder = new EmbeddedRedisBuilder();
Jedis jedis = embeddedRedisBuilder.createEmbeddedJedis();

~~~~

Notice that Jedis class is the main class defined by Jedis project but
proxied to use in-memory data instead of sending requests to remote
server.

Almost all *Redis* operations have been implemented but it has some
limitations:

- Connection commands do nothing, they do not throw any exception but
neither do any action. In fact would not have sense that they do
something.

- Scripting commands are not supported, and an
UnsupportedOperationException will be thrown if they are called.

- Transaction commands are not supported, but they do not throw any
exception, simply returns a null value and in cases of List return
type, an empty list is returned.

- Pub/Sub commands do nothing.

- Server commands are implemented, but there are some commands that
have no sense and returns a constant result:

*move* always return 1.

*debug* commands throws an UnsupportedOperationException.

*bgrewriteaof, save, bgsave, configSet, configResetStat, salveOf,
slaveOfNone and slowLogReset* returns an OK.

*configGet, slowLogGet and slowLogGetBinary* returns an empty list.

- From Key commands, only sort by pattern is not supported.

All the other operations, including flushing, expiration control, and
each operation of every datatype is supported in the same way Jedis
support it. Note that expiration management is also implemented as
described in Redis manual.

> **Warning**
>
> This implementation of Redis is provided for testing purposes not as a
> substitution of Redis. Feel free to notify any issue of this
> implementation so can be fixed or implemented.

DynamoDB Engine
==============

DynamoDB
=======

DynamoDB is a *NoSQL* database that stores structured data as *JSON-like*
documents with dynamic schemas.

**NoSQLUnit** supports *DynamoDB* by using next classes:

----------- -----------------------------------------------------
In Memory com.lordofthejars.nosqlunit.dynamodb.InMemoryDynamoDb
----------- -----------------------------------------------------

: Lifecycle Management Rules

---------------------- -------------------------------------------------
NoSQLUnit Management com.lordofthejars.nosqlunit.dynamodb.DynamoDbRule
---------------------- -------------------------------------------------

: Manager Rule

Maven Setup
-----------

To use **NoSQLUnit** with DynamoDb you only need to add next dependency:

~~~~ {.xml}

com.lordofthejars
nosqlunit-dynamodb
${version.nosqlunit}

~~~~

Note that if you are plannig to use **in-memory** approach it is implemented using
*DynamoDBLocal* . *DynamoDBLocal* is a project from Amazon which runs a local instance of DynamoDB. [DynamoDBLocal](https://docs.aws.amazon.com/amazondynamodb/latest/developerguide/DynamoDBLocal.html)

Dataset Format
--------------

Default dataset file format in *DynamoDB* module is *json* .

Datasets must have next [format](#ex.dynamodb_dataset) :

~~~~ {.json}
{
"table1": [
{
"attribute_1": { "type": "value1" },
"attribute_2": { "type": "value2" }
},
{
"attribute_3": { "N :"2" },
"attribute_4": { "S": "value4" }
}
],
"name_table2": [
...
],
....
}
~~~~

Notice that types define which value is stored but the respective value is always stored in double quotes.

> If you want to know more details about Data Types please refer to how *DynamoDB* protocol deals with them. [DynamoDB Data Type Descriptors](https://docs.aws.amazon.com/amazondynamodb/latest/developerguide/Programming.LowLevelAPI.html#Programming.LowLevelAPI.DataTypeDescriptors)

Getting Started
---------------

### Lifecycle Management Strategy

First step is defining which lifecycle management strategy is required
for your tests. Right now only **in-memory** approach is supported.

To configure **in-memory** approach you should only instantiate next
[rule](#program.inmemory_conf) :

~~~~ {.java}
import static com.lordofthejars.nosqlunit.dynamodb.InMemoryDynamoDb.InMemoryDynamoRuleBuilder.newInMemoryDynamoDbRule;

@ClassRule
public static InMemoryDynamoDb inMemoryDynamoDb = newInMemoryDynamoDbRule().build();
~~~~

### Configuring DynamoDB Connection

Next step is configuring ***DynamoDB*** rule in charge of maintaining
*DynamoDB* database into known state by inserting and deleting defined
datasets. You must register DynamoDbRule *JUnit* rule class.

~~~~ {.java}
import static com.lordofthejars.nosqlunit.dynamodb.DynamoDbRule.DynamoDbRuleBuilder.newDynamoDbRule;

@Rule
public DynamoDbRule embeddedDynamoDbRule = newDynamoDbRule().defaultEmbeddedDynamoDb();
~~~~

But also we can define it to use Spring Data DynamoDB defined instance.

If you are plannig to use **Spring Data DynamoDB**, you may require to use the *Dynamo* instance defined within Spring Application Context, mostly because you are defining an embedded connection using **DynamoDBLocal**:

~~~~ {.xml}

~~~~

In these cases you should use an special method which gets *Dyanmo* instance, instead of creating new one.

~~~~ {.java}
@Autowired
private ApplicationContext applicationContext;

@Rule
public DynamoDbRule dynamoDbRule = newDynamoDbRule().defaultSpringDynamoDb();
~~~~

> Note that you need to autowire the application context, so **NoSQLUnit** can inject instance defined within application context into *DynamoDbRule*.

### Complete Example

Consider a library application, which apart from multiple operations, it
allow us to add new books to system. Our [model](#example.book_model) is
as simple as:

~~~~ {.java}
@DynamoDBTable(tableName = "books")
public class Book {

@DynamoDBHashKey
@DynamoDBAutoGeneratedKey
private String id;

private String title;

private int numberOfPages;

public Book() {
this.tile = "untitled";
this.numberOfPages = 0;
}

public Book(String title, int numberOfPages) {
this.title = title;
this.numberOfPages = numberOfPages;
}

public void setTitle(String title) {
this.title = title;
}

public void setNumberOfPages(int numberOfPages) {
this.numberOfPages = numberOfPages;
}

public String getTitle() {
return title;
}

public int getNumberOfPages() {
return numberOfPages;
}
}
~~~~

Next business [class](#example.book_manager) is the responsible of
managing access to *DynamoDb* server:

~~~~ {.java}
public class BookManager {

private static final Logger LOGGER = LoggerFactory.getLogger(BookManager.class);

private AmazonDynamoDB client;

private DynamoDB dynamoDB;

private Table table;

public BookManager(AmazonDynamoDB client) {
dynamoDB = new DynamoDB(client);
table = dynamoDB.getTable("books");
}

public void create(Book book) {
Item item = new Item()
.withPrimaryKey("id", "01b3c3d5-75ce-49d4-91a6-8955727f8154") // hard-coded for now
.withString("title", book.getTitle())
.withNumber("numberOfPages", book.getNumberOfPages();
table.putItem(item);
}
}
~~~~

And now it is time for testing. In next
[test](#example.test_insert_book) we are going to validate that a book
is inserted correctly into database.

~~~~ {.java}
package com.lordofthejars.nosqlunit.demo.dynamodb;

public class WhenANewBookIsCreated {

@ClassRule
public static InMemoryDynamoDb inMemoryDynamoDb = newInMemoryDynamoDbRule().build();

@Rule
public DynamoDbRule dynamoDbRule = newDynamoDbRule().defaultEmbeddedDynamoDb();

@Test
@UsingDataSet(locations="initialData.json", loadStrategy=LoadStrategyEnum.CLEAN_INSERT)
@ShouldMatchDataSet(location="expectedData.json")
public void book_should_be_inserted_into_repository() {
AmazonDynamoDB amazonDynamoDB = new AmazonDynamoDBClient();
amazonDynamoDB.setEndpoint("http://localhost:8000");

BookManager bookManager = new BookManager(amazonDynamoDB);

Book book = new Book("The Lord Of The Rings", 1299);
bookManager.create(book);
}

}
~~~~

In [previous](#example.test_insert_book) test we have defined that
*DynamoDB* will be an in-mempry instance . Moreover we are setting an
[initial](#example.dataset_book) dataset in file `initialData.json`
located at classpath
`com/lordofthejars/nosqlunit/demo/dynamodb/initialData.json
` and [expected](#example.expected_dataset_book)
dataset called `expectedData.json` .

~~~~ {.json}
{
"Books":
[
{"id": {"S": "d04b4476-c9c0-4f38-8c44-4bd6b09e261b"}, title": {"S": The Hobbit"}, "numberOfPages": {"N": 293} }
]
}
~~~~

~~~~ {.json}
{
"Book":
[
{"id": {"S": "d04b4476-c9c0-4f38-8c44-4bd6b09e261b"}, "title": {"S": The Hobbit"}, "numberOfPages": {"N": "293"} },
{"id": {"S": "01b3c3d5-75ce-49d4-91a6-8955727f8154"}, "title": {"S": The Lord Of The Rings"} ,"numberOfPages": {"N": 1299"} }
]
}
~~~~

You can see unit tests at
[github](https://github.com/lordofthejars/nosql-unit/tree/master/nosqlunit-dynamodb/src/test/java/com/lordofthejars/nosqlunit/dynamodb/integration/WhenDynamoDbRuleIsRegistered.java)
.

InfluxDB Engine
==============

InfluxDB
=======

InfluxDB is a time series database designed to handle high write and query loads.

**NoSQLUnit** supports *DynamoDB* by using next classes:

----------- -----------------------------------------------------
In Memory com.lordofthejars.nosqlunit.influxdb.InMemoryInfluxDb
----------- -----------------------------------------------------

: Lifecycle Management Rules

---------------------- -------------------------------------------------
NoSQLUnit Management com.lordofthejars.nosqlunit.influxdb.InfluxDbRule
---------------------- -------------------------------------------------

: Manager Rule

Maven Setup
-----------

To use **NoSQLUnit** with InfluxDb you only need to add next dependency:

~~~~ {.xml}

com.lordofthejars
nosqlunit-influxdb
${version.nosqlunit}

~~~~

Note that if you are plannig to use **in-memory** approach it is implemented using
*embed-influxDB* . *embed-influxDB* is a project which runs a local instance of InfluxDB. [embed-influxDB](https://github.com/APISENSE/embed-influxDB)

Dataset Format
--------------

Default dataset file format in *InfluxDB* module is *json* .

Datasets must have next [format](#ex.influxdb_dataset) :

~~~~ {.json}
{
"measurement1": [
{
"time": 1514764800000000000,
"precision": "NANOSECONDS",
"tags": { "tag_name": "tag_value" },
"fields": { "field_name": "field_value" }
},
{
"time": 1514764800000000000,
"precision": "NANOSECONDS",
"tags": { "tag_name": "tag_value" },
"fields": { "field_name": "field_value" }
}
],
"measurement2": [
...
],
....
}
~~~~

Getting Started
---------------

### Lifecycle Management Strategy

First step is defining which lifecycle management strategy is required
for your tests. Right now only **in-memory** approach is supported.

To configure **in-memory** approach you should only instantiate next
[rule](#program.inmemory_conf) :

~~~~ {.java}
import static com.lordofthejars.nosqlunit.inluxdb.InMemoryInfluxDb.InMemoryInfluxRuleBuilder.newInMemoryInfluxDbRule;

@ClassRule
public static InMemoryInfluxDb inMemoryInfluxDb = newInMemoryInfluxDbRule().build();
~~~~

### Configuring InfluxDB Connection

Next step is configuring ***InfluxDB*** rule in charge of maintaining
*InfluxDB* database into known state by inserting and deleting defined
datasets. You must register InfluxDbRule *JUnit* rule class.

~~~~ {.java}
import static com.lordofthejars.nosqlunit.influxdb.InfluxDbRule.InfluxDbRuleBuilder.newInfluxDbRule;

@Rule
public InfluxDbRule embeddedInfluxDbRule = newInfluxDbRule().defaultEmbeddedInfluxDb("test-db");
~~~~

But also we can define it to use Spring Data InfluxDB defined instance.

If you are plannig to use **Spring Data InfluxDB**, you may require to use the *InfluxDB* instance defined within Spring Application Context, mostly because you are defining an embedded connection using **embed-influxDB**:

In these cases you should use an special method which gets *InfluxDB* instance, instead of creating new one.

~~~~ {.java}
@Autowired
private ApplicationContext applicationContext;

@Rule
public InfluxDbRule influxDbRule = newInfluxDbRule().defaultSpringInfluxDb();
~~~~

> Note that you need to autowire the application context, so **NoSQLUnit** can inject instance defined within application context into *InfluxDbRule*.

You can see unit tests at
[github](https://github.com/lordofthejars/nosql-unit/tree/master/nosqlunit-dynamodb/src/test/java/com/lordofthejars/nosqlunit/influxdb/integration/WhenInfluxDbRuleIsRegistered.java)
.

# MarkLogic Engine

# MarkLogic

MarkLogic is a commercial, *NoSQL* database with support for different
document formats, like XML, JSON and unstructured content.

**NoSQLUnit** targets [MarkLogic](https://www.marklogic.com/) by
utilizing following classes:

| | |
| ------- | -------------------------------------------------------- |
| Managed | `com.lordofthejars.nosqlunit.marklogic.ManagedMarkLogic` |

Lifecycle Management Rules

| | |
| -------------------- | ----------------------------------------------------- |
| NoSQLUnit Management | `com.lordofthejars.nosqlunit.marklogic.MarkLogicRule` |

Manager Rule

## Maven Setup

To use **NoSQLUnit** with MarkLogic you need to add following
dependencies and repository. Please consult [MarkLogic Java
API](https://developer.marklogic.com/products/java) for further details:

``` xml

com.lordofthejars
nosqlunit-marklogic
${version.nosqlunit}

com.marklogic
marklogic-client-api
${version.marklogic-client-api}


jcenter
http://jcenter.bintray.com

```

## Data Set Formats

Default data set file format in *MarkLogic* module is XML. JSON and
binary formats are also supported.

XML data sets must have next [format for a single
document](#ex.marklogic_xml_data%20set_single) and the [next one for
seeding of multiple documents at
once](#ex.marklogic_xml_data%20set_multi) :

``` xml

The Hobbit
293

```

``` xml


The Hobbit
293


The Silmarillion the Myths and Legends of Middle Earth
365


The Lord Of The Rings
1299

```

JSON data sets must have next [format](#ex.marklogic_json_data%20set) :

``` json
{
"/books/The Hobbit.json": {
"collections": [
"books",
"bestsellers"
],
"data": {
"title": "The Hobbit",
"numberOfPages": 293
}
},
"/books/The Silmarillion the Myths and Legends of Middle Earth.json": {
"data": {
"title": "The Silmarillion the Myths and Legends of Middle Earth",
"numberOfPages": 365
}
}
....
}
```

Note that if attributes value are integers, double quotes are not
required.

where:

- *uri* : the ID (or URI) of the document in the MarkLogic database.

- *data* : the actual content of the document in the MarkLogic
database.

- *collections* : the list of MarkLogic collections the document can
be added to, comma separated, optional.

Binary and text data sets have a different
[format](#ex.marklogic_binary_data%20set) handling:

``` shell
/
|
books/The Hobbit.txt
books/The Silmarillion the Myths and Legends of Middle Earth.docx
books/The Lord Of The Rings.pdf
authors/J. R. R. Tolkien.jpg
```

Note that the path of the document, relative to the base test class' one
determines the document ID (URI) in the MarkLogic database. A
collections assignment is not supported. For more advanced uses cases
for ingesting binary contents into MarkLogic database (not supported by
NoSQLUnit) refer to [Content Processing
Framework](https://docs.marklogic.com/guide/cpf.pdf) .

## Getting Started

### Lifecycle Management Strategy

First step is defining which lifecycle management strategy is required
for your tests. Depending on kind of test you are implementing (unit
test, integration test, deployment test, ...) you will require a managed
or remote approach. *There is no support for embedded approach since no
embedded version is provided by the vendor.* Additionally, NoSQLUnit
provides an adapter for Docker in a managed mode.

#### Managed Lifecycle

To configure the managed way, two possible approaches can be used:

The first one is using a **docker container**. This is a way to have a
great flexibility with different database version or while evaluating a
product. For details see this [blog
entry](https://www.marklogic.com/blog/building-a-marklogic-docker-container/):

``` java
import static com.lordofthejars.nosqlunit.marklogic.ManagedMarkLogic.MarkLogicServerRuleBuilder.newManagedMarkLogicRule;

@ClassRule
public static final ManagedMarkLogic managedMarkLogic = newManagedMarkLogicRule().dockerCommand("/sbin/docker").dockerContainer("marklogic").build();
```

Note that you can define either a container name or container ID
directly as supported by Docker.

By default managed *MarkLogic* in Docker uses default values, but can be
configured programmatically as shown in previous
[example](#program.marklogic_docker_conf) :

| | |
| -------------- | ------------------------------------------ |
| Docker Command | The executable Docker binary is `docker` . |

Default Docker Values

The second strategy is **starting and stopping an already installed
server** on executing machine, by triggering start and stop on the
MarkLogic Service. The class-level
[rule](#program.marklogic_managed_conf) should be registered:

``` java
import static com.lordofthejars.nosqlunit.marklogic.ManagedMarkLogic.MarkLogicServerRuleBuilder.newManagedMarkLogicRule;

@ClassRule
public static final ManagedMarkLogic managedMarkLogic = newManagedMarkLogicRule().build();
```

By default managed *MarkLogic* rule uses a set of default values, but
can be configured programmatically as shown in the previous
[example](#program.marklogic_managed_conf) :

Default Managed Values

Target path
This is the directory where the starting process will be executed. Usually you don't have to modify it. By default it is: target/marklogic-temp

Admin Port
Configures the port the server is listening on for administration commands and used for 'heartbeats'. Default is 8001.

MarkLogic Service prefix
Determines the service command and is either one of:


  • Windows : %ProgramFiles%\MarkLogic\


  • OSX : ~/Library/StartupItems/


  • Unix : /sbin/service


User name
MarkLogic administrator having permissions to access administrative interfaces.

Password
MarkLogic administrator's password.

#### Remote Lifecycle

Configuring **remote** approach does not require any special rule
because you (or a system like Maven ) is the responsible of starting and
stopping the server. This mode is used in deployment tests where you are
testing your application on real environment.

### Configuring MarkLogic Connection

Next step is configuring **MarkLogic** rule in charge of maintaining
documents into known state by inserting and deleting defined data sets.
You must register `MarkLogicRule` *JUnit* rule class, which requires a
configuration parameter with information like host, port, application
user and password, etc.

To make developer's life easier and code more readable, a fluent
interface can be used to create these configuration objects. There are
two different kinds of configuration builders:
[Managed](#program.managed_marklogic_connection_parameters) and
[Remote](#program.remote_marklogic_connection_parameters) .

#### Managed/Remote Connections

The configuration of a connection to a local or remote *MarkLogic*
server is pretty much the same. Default values are:

| | |
| ---------------------------------------------------------------------------------------------- | ----------------------------- |
| Host | localhost |
| Port | 8000 |
| Credentials | No authentication parameters. |
| Secure (whether to use TLS) | false |
| Use Gateway (whether a MarkLogic Cluster Gateway is in use) | false |
| Database | Documents |
| Clean Directory (during the *Delete* test phase controls which directory should be erased) | None |
| Clean Collections (during the *Delete* test phase controls which collections should be erased) | None |

Default Connection Values

``` java
import static com.lordofthejars.nosqlunit.marklogic.ManagedMarkLogicConfigurationBuilder.marklogic;

@Rule
public MarkLogicRule markLogicRule = new MarkLogicRule(marklogic().build());
```

``` java
import static com.lordofthejars.nosqlunit.marklogic.RemoteMarkLogicConfigurationBuilder.remoteMarkLogic;

@Rule
public MarkLogicRule markLogicRule = new MarkLogicRule(remoteMarkLogic().host("localhost").port(9001).secure().useGateway().database("some-db").build());
```

Note that the only differences between the local and remote connections
is that the remote host has no predefined value.

### Verifying Content

`@ShouldMatchDataSet` is also supported for *MarkLogic* content but we
should keep in mind some considerations.

To compare two XML documents, the stored content is exported as DOM and
then compared with expected *XML* using
[XmlUnit](https://www.xmlunit.org/) framework. Whereas the comparison of
*JSON* documents uses
[JsonUnit](https://github.com/lukas-krecan/JsonUnit) framework. The
[control attributes](#attr.marklogic_data%20set_controls) in the
expected file are ignored since they don't appear in the database
document. Furthermore the ignore option is supported by either using
[XPath expressions or a bean-property style with
JSON](#ex.marklogic_flexible_comparison) . Note that neither ignore
styles are possible with unstructured documents (like binary or text).
Unstructured documents will be compared byte-by-byte.

``` java
@CustomComparisonStrategy(comparisonStrategy = MarkLogicFlexibleComparisonStrategy.class)
public class MarkLogicFlexibleComparisonStrategyTest {

...............

@Test
@UsingDataSet(locations = "jane-john.xml")
@ShouldMatchDataSet(location = "jane-john-ignored.xml")
@IgnorePropertyValue(properties = {"//phoneNumber/type", "/person/age", "//address/@type"})
public void shouldIgnoreXmlPropertiesInFlexibleStrategy() {
}

@Test
@UsingDataSet(locations = "jane-john.json")
@ShouldMatchDataSet(location = "jane-john-ignored.json")
@IgnorePropertyValue(properties = {"phoneNumbers[*].type", "age"})
public void shouldIgnoreJsonPropertiesInFlexibleStrategy() {
}
}
```

### Full Example

To show how to use **NoSQLUnit** with *MarkLogic*, we are going to
create a very simple application which searches for books stored in the
database.

[Generic-, Xml -and JsonBookManager](#program.books_marklogic_manager)
are the business classes responsible of inserting new books and querying
for them either using a book title or listing all books.

``` java
public abstract class GenericBookManager {

protected static final String BOOKS_DIRECTORY = "/books/";

protected DatabaseClient client;

public GenericBookManager(DatabaseClient client) {
this.client = client;
}

public void create(Book book) {
DocumentManager documentManager = documentManager();
DocumentWriteSet writeSet = documentManager.newWriteSet();
ContentHandle contentHandle = contentHandleFactory().newHandle(Book.class);
contentHandle.set(book);
writeSet.add(BOOKS_DIRECTORY + book.getTitle() + extension(), contentHandle);
documentManager.write(writeSet);
}

public Book findBookById(String id) {
List result = search(new StructuredQueryBuilder().document(BOOKS_DIRECTORY + id + extension()), 1);
return result.isEmpty() ? null : result.get(0);
}

public List findAllBooksInCollection(String... collections) {
return search(new StructuredQueryBuilder().collection(collections), 1);
}

public List findAllBooks() {
return search(new StructuredQueryBuilder().directory(true, BOOKS_DIRECTORY), 1);
}

public List search(QueryDefinition query, long start) {
List result = new ArrayList();
DocumentPage documentPage = documentManager().search(
query,
start
);
while (documentPage.hasNext()) {
ContentHandle handle = contentHandleFactory().newHandle(Book.class);
handle = documentPage.nextContent(handle);
result.add(handle.get());
}
return result;
}

protected abstract DocumentManager documentManager();

protected abstract Format format();

protected abstract ContentHandleFactory contentHandleFactory();

protected String extension() {
return "." + format().name().toLowerCase();
}
}

....................

public class XmlBookManager extends GenericBookManager {

private final ContentHandleFactory contentHandleFactory;

public XmlBookManager(DatabaseClient client) {
super(client);
try {
contentHandleFactory = JAXBHandle.newFactory(Book.class);
} catch (JAXBException e) {
throw new IllegalArgumentException("Couldn't instantiate the JAXB factory", e);
}
}

@Override
protected DocumentManager documentManager() {
return client.newXMLDocumentManager();
}

@Override
protected Format format() {
return XML;
}

@Override
protected ContentHandleFactory contentHandleFactory() {
return contentHandleFactory;
}
}

................

public class JsonBookManager extends GenericBookManager {

private final ContentHandleFactory contentHandleFactory;

public JsonBookManager(DatabaseClient client) {
super(client);
contentHandleFactory = JacksonDatabindHandle.newFactory(Book.class);
}

@Override
protected DocumentManager documentManager() {
return client.newJSONDocumentManager();
}

@Override
protected Format format() {
return JSON;
}

@Override
protected ContentHandleFactory contentHandleFactory() {
return contentHandleFactory;
}
}
```

And now we get started with [integration
tests](#program.books_marklogic_integration).

``` java
...................

import static com.lordofthejars.nosqlunit.marklogic.ManagedMarkLogic.MarkLogicServerRuleBuilder.newManagedMarkLogicRule;
import static com.lordofthejars.nosqlunit.marklogic.MarkLogicRule.MarkLogicRuleBuilder.newMarkLogicRule;

public class WhenANewBookIsCreated {

@ClassRule
public static final ManagedMarkLogic managedMarkLogic = newManagedMarkLogicRule().build();

@Rule
public MarkLogicRule managedMarkLogicRule = newMarkLogicRule().defaultManagedMarkLogic();

@Inject
private DatabaseClient client;

@Test
@UsingDataSet(locations = "books.xml", loadStrategy = CLEAN_INSERT)
@ShouldMatchDataSet(location = "books-expected.xml")
public void xml_book_should_be_inserted_into_database() {
GenericBookManager bookManager = new XmlBookManager(client);
Book book = new Book("The Road Goes Ever On", 96);
bookManager.create(book);
}

@Test
@UsingDataSet(locations = "books.json", loadStrategy = CLEAN_INSERT)
@ShouldMatchDataSet(location = "books-expected.json")
public void json_book_should_be_inserted_into_database() {
GenericBookManager bookManager = new JsonBookManager(client);
Book book = new Book("The Road Goes Ever On", 96);
bookManager.create(book);
}

@Test
@UsingDataSet(locations = {"books/Lorem Ipsum.docx", "books/Lorem Ipsum.txt"}, loadStrategy = CLEAN_INSERT)
@ShouldMatchDataSet(location = "books/Lorem Ipsum-expected.pdf")
public void pdf_book_should_be_inserted_into_database() throws IOException, URISyntaxException {
GenericBookManager bookManager = new BinaryBookManager(client);
byte[] content = Files.readAllBytes(
Paths.get(getClass().getResource("books/Lorem Ipsum.pdf").toURI())
);
BinaryBook book = new BinaryBook("/books/Lorem Ipsum.pdf", content);
bookManager.create(book);
}
}
```

Note that in both cases we are using similar data sets as initial state,
which look like:

``` xml


The Hobbit
293


The Silmarillion the Myths and Legends of Middle Earth
365


The Lord Of The Rings
1299

```

And, for JSON:

``` xml
{
"/books/The Hobbit.json": {
"collections": [
"bestsellers"
],
"data": {
"title": "The Hobbit",
"numberOfPages": 293
}
},
"/books/The Silmarillion the Myths and Legends of Middle Earth.json": {
"data": {
"title": "The Silmarillion the Myths and Legends of Middle Earth",
"numberOfPages": 365
}
},
"/books/The Lord Of The Rings.json": {
"collections": [
"bestsellers"
],
"data": {
"title": "The Lord Of The Rings",
"numberOfPages": 1299
}
}
}
```

## Current Limitations

- Semantic searches are not supported

- Currently there is no way to define a control data set for binary
documents containing *multiple* entries

Managing lifecycle of multiple instances
========================================

Sometimes your test will require that more than one instance of same
database server (running in different ports) was started. For example
for testing database sharding. In [next
example](#advanced.multipleinstances-database) we see how to configure
**NoSQLUnit** to manage lifecycle of multiple instances.

~~~~ {.java}
@ClassRule
public static ManagedRedis managedRedis79 = newManagedRedisRule().redisPath("/opt/redis-2.4.16")
.targetPath("target/redis1")
.configurationPath(getAbsoluteFilePath("src/test/resources/redis_6379.conf"))
.port(6379)
.build();

@ClassRule
public static ManagedRedis managedRedis80 = newManagedRedisRule().redisPath("/opt/redis-2.4.16")
.targetPath("target/redis2")
.configurationPath(getAbsoluteFilePath("src/test/resources/redis_6380.conf"))
.port(6380)
.build();

~~~~

> **Warning**
>
> Note that target path should be set to different values for each
> instance, if not some started processes could not be shutdown.

Fast Way
========

When you instantiate a Rule for maintaining database into known state (
MongoDbRule , Neo4jRule , ...) **NoSQLUnit** requires you set a
configuration object with properties like host, port, database name, ...
but although most of the time default values are enough, we still need
to create the configuration object, which means our code becomes harder
to read.

We can avoid this by using an inner builder inside each rule, which
creates for us a Rule with default parameters set. For example for
Neo4jRule :

~~~~ {.java}
import static com.lordofthejars.nosqlunit.neo4j.Neo4jRule.Neo4jRuleBuilder.newNeo4jRule;
@Rule
public Neo4jRule neo4jRule = newNeo4jRule().defaultEmbeddedNeo4j();
~~~~

In previous [example](#advanced.fastway-database) Neo4jRule is
configured to be used as embedded approach with default parameters.

Another example using CassandraRule in managed way.

~~~~ {.java}
import static com.lordofthejars.nosqlunit.cassandra.CassandraRule.CassandraRuleBuilder.newCassandraRule;
@Rule
public CassandraRule cassandraRule = newCassandraRule().defaultManagedCassandra("Test Cluster");
~~~~

And each Rule contains their builder class to create default values.

Simultaneous engines
====================

Sometimes applications will contain more than one *NoSQL* engine, for
example some parts of your model will be expressed better as a graph (
Neo4J for example), but other parts will be more natural in a column way
(for example using Cassandra ). **NoSQLUnit** supports this kind of
scenarios by providing in integration tests a way to not load all
datasets into one system, but choosing which datasets are stored in each
backend.

For declaring more than one engine, you must give a name to each
database *Rule* using `connectionIdentifier()` method in configuration
instance.

~~~~ {.java}
@Rule
public MongoDbRule remoteMongoDbRule1 = new MongoDbRule(mongoDb()
.databaseName("test").connectionIdentifier("one").build() ,this);
~~~~

And also you need to provide an identified dataset for each engine, by
using `withSelectiveLocations` attribute of `@UsingDataSet` annotation.
You must set up the pair "named connection" / datasets.

~~~~ {.java}
@UsingDataSet(withSelectiveLocations =
{ @Selective(identifier = "one", locations = "test3") },
loadStrategy = LoadStrategyEnum.REFRESH)
~~~~

In [example](#advanced.dataset-selective) we are refreshing database
declared on [previous example](#advanced.name-database) with data
located at *test3* file.

Also works in expectations annotation:

~~~~ {.java}
@ShouldMatchDataSet(withSelectiveMatcher =
{ @SelectiveMatcher(identifier = "one", location = "test3")
})
~~~~

When you use more than one engine at a time you should take under
consideration next rules:

- If location attribute is set, it will use it and will ignore
withSelectiveMatcher
attribute data. Location data is populated through all registered
systems.
- If location is not set, then system tries to insert data defined in
withSelectiveMatcher
attribute to each backend.
- If
withSelectiveMatcher
attribute is not set, then default strategy (explained in
section
) is taken. Note that default strategy will replicate all datasets
to defined engines.

You can also use the same approach for inserting data into same engine
but in different databases. If you have one MongoDb instance with two
databases, you can also write tests for both databases at one time. For
example:

~~~~ {.java}
@Rule
public MongoDbRule remoteMongoDbRule1 = new MongoDbRule(mongoDb()
.databaseName("test").connectionIdentifier("one").build() ,this);

@Rule
public MongoDbRule remoteMongoDbRule2 = new MongoDbRule(mongoDb()
.databaseName("test2").connectionIdentifier("two").build() ,this);

@Test
@UsingDataSet(withSelectiveLocations = {
@Selective(identifier = "one", locations = "json.test"),
@Selective(identifier = "two", locations = "json3.test") },
loadStrategy = LoadStrategyEnum.CLEAN_INSERT)
public void my_test() {...}
~~~~

Support for JSR-330
===================

**NoSQLUnit** supports two annotations of JSR-330 aka Dependency
Injection for Java. Concretely @Inject and @Named annotations.

During test execution you may need to access underlying class used to
load and assert data to execute extra operations to backend.
**NoSQLUnit** will inspect @Inject annotations of test fields, and try
to set own driver to attribute. For example in case of MongoDb ,
com.mongodb.Mongo instance will be injected.

~~~~ {.java}
@Rule
public MongoDbRule remoteMongoDbRule1 = new MongoDbRule(mongoDb()
.databaseName("test").build() ,this);

@Inject
private Mongo mongo;
~~~~

> **Warning**
>
> Note that in [example](#advanced.injection) we are setting `this` as
> second parameter to the Rule. This is only required in versions of JUnit prior to 4.11. In new versions is no longer required passing the this parameter.

But if you are using more than one engine at same time (see
[chapter](#advanced.simultaneous-engine-title) ) you need a way to
distinguish each connection. For fixing this problem, you must use
@Named annotation by putting the identifier given in configuration
instance. For example:

~~~~ {.java}
@Rule
public MongoDbRule remoteMongoDbRule1 = new MongoDbRule(mongoDb()
.databaseName("test").connectionIdentifier("one").build() ,this);

@Rule
public MongoDbRule remoteMongoDbRule2 = new MongoDbRule(mongoDb()
.databaseName("test2").connectionIdentifier("two").build() ,this);

@Named("one")
@Inject
private Mongo mongo1;

@Named("two")
@Inject
private Mongo mongo2;
~~~~

There are some situations (mostly if using *Arquillian*) that you want to inject the value managed by container instead of the one managed by **NoSQLUnit**. To avoid an injection conflict **NoSQLUnit** provides an special annotation called *@ByContainer*. By using it, the injector processor will leave the field untouched.

~~~~ {.java}
@Inject
@ByContainer
private Mongo mongo2;
~~~~

Spring Data
===========

With **NoSQLUnit** you can also write tests for `Spring Data` project. You can

[Spring Data MongoDB](http://www.lordofthejars.com/2013/01/testing-spring-data-mongodb.html)

[Spring Data Neo4j](http://www.lordofthejars.com/2013/03/testing-spring-data-neo4j-applications.html)

Stay In Touch
=============

Stay in Touch
-------------

---------- ------------------------------------------------------------------
Email: asotobu at gmail.com
Blog: [Lord Of The Jars](www.lordofthejars.com)
Twitter: @alexsotob
Github: [NoSQLUnit Github](https://github.com/lordofthejars/nosql-unit/)
Forum: [Forum](http://nosqlunit.1092832.n5.nabble.com)
Mailing: [Mailing List](http://nosqlunit.1093269.n5.nabble.com)
---------- ------------------------------------------------------------------