TransiStore-on-DropWizard

Version of TransiStore service to deploy on DropWizard

License	License The Apache Software License, Version 2.0
Categories	Categories DropWizard Container Microservices
GroupId	GroupId com.fasterxml.transistore
ArtifactId	ArtifactId transistore-dropwizard
Last Version	Last Version 0.9.9
Release Date	Release Date 11-Apr-2014
Type	Type jar
Description	Description TransiStore-on-DropWizard Version of TransiStore service to deploy on DropWizard
Project Organization	Project Organization FasterXML

Download transistore-dropwizard

Filename	Size
transistore-dropwizard-0.9.9.pom
transistore-dropwizard-0.9.9.jar	90 KB
transistore-dropwizard-0.9.9-sources.jar	84 KB
transistore-dropwizard-0.9.9-javadoc.jar	70 KB
Browse

How to add to project

Apache Maven

<!-- https://jarcasting.com/artifacts/com.fasterxml.transistore/transistore-dropwizard/ -->
<dependency>
    <groupId>com.fasterxml.transistore</groupId>
    <artifactId>transistore-dropwizard</artifactId>
    <version>0.9.9</version>
</dependency>

Gradle Groovy

// https://jarcasting.com/artifacts/com.fasterxml.transistore/transistore-dropwizard/
implementation 'com.fasterxml.transistore:transistore-dropwizard:0.9.9'

Gradle Kotlin

// https://jarcasting.com/artifacts/com.fasterxml.transistore/transistore-dropwizard/
implementation ("com.fasterxml.transistore:transistore-dropwizard:0.9.9")

Apache Buildr

'com.fasterxml.transistore:transistore-dropwizard:jar:0.9.9'

Apache Ivy

<dependency org="com.fasterxml.transistore" name="transistore-dropwizard" rev="0.9.9">
  <artifact name="transistore-dropwizard" type="jar" />
</dependency>

Groovy Grape

@Grapes(
@Grab(group='com.fasterxml.transistore', module='transistore-dropwizard', version='0.9.9')
)

Scala SBT

libraryDependencies += "com.fasterxml.transistore" % "transistore-dropwizard" % "0.9.9"

Leiningen

[com.fasterxml.transistore/transistore-dropwizard "0.9.9"]

Dependencies

compile (6)

Group / Artifact	Type	Version
com.fasterxml.transistore : transistore-service	jar	0.9.9
org.slf4j : slf4j-api	jar
com.fasterxml.storemate : storemate-store	jar
com.fasterxml.storemate : storemate-backend-bdb-je	jar
com.fasterxml.clustermate : clustermate-service	jar	0.9.26
com.fasterxml.clustermate : clustermate-dropwizard	jar	0.9.26

test (1)

Group / Artifact	Type	Version
junit : junit	jar	4.8.2

Project Modules

There are no modules declared in this project.

TransiStore

TransiStore is a distributed data store for temporary (time-bound, possibly but not necessarily transient) data, such as intermediate processing results (for Map/Reduce, Hadoop), staging area between high-volume producers and consumers (log processing and aggregation), or just as general-purpose store for data exchange.

TransiStore is built on ClusterMate platform, and serves as a sample system. Note that implementing storage system with different behavior, using ClusterMate (possibly using TransiStore as sample code) is highly encouraged: it is not meant as "The" storage system; although if it works as-is for your use case, all the better.

Basics

License

'TransiStore` (and all its dependencies) are licensed under Apache 2.0.

Functionality

TransiStore can be viewed as a distributed key/value ("BLOB") store which explicitly supports Key Range queries for primary keys. Stored entries have explicit time-to-live setting and service removes expired entries if they have not been explicitly deleted earlier. Clusters are horizontally scalable and allow addition/removal of storage nodes, without restarting other nodes.

The main limitation -- as of now, at least -- is that content is Write-Once; that is, entries are immutable after being added: they may however. be explicitly deleted (and if not, will expire as per time-to-live settings). While this is a limit that use cases must conform to, it greatly simplifies implementation and improves handling performance, as conflict resolution is simple to handle (due to minimal number of cases to resolve).

Underlying (per-node) storage

Storage layer comes from ClusterMate (which in turn builds on StoreMate project.

Here are some highlights (for more refer to StoreMate project) of storage system:

Pluggable backends: default implementation uses BDB-JE for local storage, but there is also experimental LevelDB backend.
Automatic on-the-fly (de)compression; negotiated using standard HTTP; supports multiple compression methods (client can pre-compress instead of server, or defer decompression)
Partial content queries (HTTP Range supported)
Key-range queries.

and additional features that ClusterMate provides are:

Peer-to-peer content synchronization used for on-going content synchronization, recovery, and bootstrapping of newly added nodes
Configurable redundancy (number of copies to store), with different client-controlled minimal required writes.
Client-configurable data expiration rates (per-entry time-to-live) to ensure that content will not live forever even if no explicit deletions are performed
Key partitioning to support cluster-wide key-range queries (routing by partition; queries within single partition) -- note: key structure fully configurable at ClusterMate level; TransiStore uses a simple partition + path key structure.

Configuration

Configuration is simple: it consists of a single JSON configuration file. Sample configuration files can be found from under sample/.

Why not X?

(where X may be "HBase", "Cassandra", "Voldemort", "Riak" or any of dozens of distributed key/value stores)

Features that separate TransiStore for most other "noSQL" key/value stores are ones listed earlier:

Can store LARGE entries (similar to S3): authors use it for storing 10 megabyte sized entries. It is designed for such usage -- you can even do partial GETs (with HTTP Range header)
Key-range queries: some noSQL systems (like HBase) support them, but most don't (Cassandra, Voldemort), at least with recommended set ups. TransiStore is designed to fully support them.
Automatic expiration of data: useful for intermediate results (Hadoop processing? Log aggregation?), weekly reports, or even some types of caching. You just specify maximum life-time for entries and system purges them as necessary
Simple configuration: single configuration file, used for all nodes; and even clients if you want.
Automatic node recovery: once a node has its configuration file, it is ready to start up and synchronize entries it needs -- no manual process or delicate coordination needed (NOTE: there will be need to build dashboards, tools to help track progress -- but foundational functionality works for actual syncing)

There are many more advanced features; but most of them are common within this class of servers (recovery, configurable number of copies required, statistics/metrics).

Documentation

Here are things you will probably want to read first:

Installing (my first TransiStore cluster): Build or download jar, find a config, start up!
Configuration: what do those sample configs contain?
Tools: Yes, we have small set of command-line tools too; accessed with tstore.sh wrapper (invoking ts-client jar)

and here's some more stuff:

Dependencies -- what is TransiStore built of?

FasterXML, LLC

Versions

Version
0.9.9 11-Apr-2014
0.9.8 04-Jan-2014
0.9.7 24-May-2013
0.9.5 01-May-2013
0.9.2 19-Mar-2013
0.9.0 13-Mar-2013
0.7.1 08-Mar-2013

TransiStore-on-DropWizard

License

Categories

GroupId

ArtifactId

Last Version

Release Date

Type

Description

Project Organization

Download transistore-dropwizard

How to add to project

Dependencies

compile (6)

test (1)

Project Modules

TransiStore

Basics

License

Functionality

Underlying (per-node) storage

Configuration

Why not X?

Documentation

FasterXML, LLC

Versions