Using phantom

phantom

============== Reactive type-safe Scala DSL for Cassandra

To stay up-to-date with our latest releases and news, follow us on Twitter: @websudos.

If you use phantom, please consider adding your company to our list of adopters. Phantom is and will always be completely free and open source, but the more adopters our projects have, the more people from our company will actively work to make them better.

Using phantom

Scala 2.10 and 2.11 releases

We publish phantom in 2 formats, stable releases and bleeding edge.

The stable release is always available on Maven Central and will be indicated by the badge at the top of this readme. The Maven Central badge is pointing at the latest version
Intermediary releases are available through our managed Bintray repository available at https://dl.bintray.com/websudos/oss-releases/. The latest version available on our Bintray repository is indicated by the Bintray badge at the top of this readme.

Latest versions

Check the badges at the top of this README for the latest version. The badges are automatically updated in realtime, where as this README isn't.

Latest stable version: 1.8.9 (Maven Central)
Bleeding edge: 1.8.12 (Websudos OSS releases on Bintray)

You will also be needing the default resolvers for Maven Central and the typesafe releases. Phantom will never rely on any snapshots or be published as a snapshot version, the bleeding edge is always subject to internal scrutiny before any releases into the wild.

The Apache Cassandra version used for auto-embedding Cassandra during tests is: val cassandraVersion = "2.1.0-rc5". You will require JDK 7 to use Cassandra, otherwise you will get an error when phantom tries to start the embedded database. The recommended JDK is the Oracle variant.

Version highlights and upcoming features

1.8.0: A new QueryBuilder, written from the ground up, in idiomatic Scala
1.8.0: Added support for type-safe ALTER queries
1.8.0: Support for advanced CQL options
1.9.0: Type safe prepared statements
1.9.0: Automated Schema migrations
2.0.0: Type safe user defined types
Breaking changes in DSL and connectors
- A new import structure
- Propagating parse errors
1.9.0: Automated table creations
1.9.0: Automated table truncation.
1.9.0: Big performance improvements

Breaking API changes in Phantom 1.8.0 and beyond.

The 1.8.0 release constitutes a major re-working of a wide number of internal phantom primitives, including but not limited to a brand new Scala flavoured QueryBuilder with full support for all CQL 3 features and even some of the more "esoteric" options available in CQL. We went above and beyond to try and offer a tool that's comprehensive and doesn't miss out on any feature of the protocol, no matter how small.

If you are wondering what happened to 1.7.0, it was never publicly released as testing the new querybuilder entailed serious internal efforts and for such a drastic change we wanted to do as much as possible to eliminate books. Surely there will be some still found, but hopefully very few and with your help they will be very short lived.

Ditching the Java Driver was not a question of code quality in the driver, but rather an opportunity to exploit the more advanced Scala type system features to introduce behaviour such as preventing duplicate limits on queries using phantom types, to prevent even more invalid queries from compiling, and to switch to a fully immutable QueryBuilder that's more in tone with idiomatic Scala, as opposed to the Java-esque mutable alternative already existing the java driver.

A new import structure

import com.websudos.phantom.Implicits._ has now been renamed to import com.websudos.phantom.dsl._. The old import is still there but deprecated.

A natural question you may ask is why we resorted to seemingly unimportant changes, but the goal here was to enforce the new implicit mechanism and use a uniform importing experience across all modules. So you can have the series of import com.websudos.phantom.dsl._, import com.websudos.phantom.thrift._, import com.websudos.phantom.testkit._ and so on, all identical, all using Scala package object definitions as intended.

Propagating parse errors

Until now, our implementation of Cassandra primitives has been based on the Datastax Java Driver and on an Option based DSL. This made it hard to deal with parse errors at runtime, specifically those situations when the DSL was unable to parse the required type from the Cassandra result or in a simple case where null was returned for a non-optional column.

The core of the Column[Table, Record, ValueType].apply(value: ValueType) method which was used to parse rows in a type safe manner was written like this:

import com.datastax.driver.core.Row

def apply(row: Row):  = optional(row).getOrElse(throw new Exception("Couldn't parse things")

This approach left the original exception which caused the parser to parse a null and subsequently a None was ignored.

With the new type-safe primitive interface that no longer relies on the Datastax Java driver we were also able to move the Option based parsing mechanism to a Try mechanism which will now log all parse errors un-altered, in the exact same way they are thrown at compile time, using the logger for the given table.

Internally, we are now using something like this:

   def optional(r: Row): Try[T]
 
   def apply(r: Row): T = optional(r) match {
     case Success(value) => value
     case Failure(ex) => {
       table.logger.error(ex.getMessage)
       throw ex
     }
   }

The exception is now logged and propagated with no interference. We intercept it to provide consistent logging in the same table logger where you would naturally monitor for logs.

Improving query performance

Play enumerators and Twitter ResultSpools have been removed from the default one, get, fetch and collect methods. You will have to explicitly call fetchEnumerator and fetchSpool if you want result throttling through async lazy iterators. This will offer everyone a significant performance improvement over query performance. Async iterators needed a lot of expensive "magic" to work properly, but you don't always need to fold over 100k records. That behaviour was implemented both as means of showing off as well as doing all in one loads like the Spark - Cassandra connector performs. E.g dumping C* data into HDFS or whatever backup system. A big 60 - 70% gain should be expected.

Phantom connectors now require an implicit com.websudos.phantom.connectors.KeySpace to be defined. Instead of using a plain string, you just have to use KeySpace.apply or simply: trait MyConnector extends Connector { implicit val keySpace = KeySpace("your_def") } . This change allows us to replace the existing connector model and vastly improve the number of concurrent cluster connections required to perform operations on various keyspaces. Instead of the 1 per keyspace model, we can now successfully re-use the same session without even needing to switch as phantom will use the full CQL reference syntax, e.g SELECT FROM keyspace.table instead of SELECT FROM table.

An entirely new set of options have been enabled in the type safe DSLs. You can now alter tables, specify advanced compressor behaviour and so forth, all from within phantom and with the guarantee of auto-completion and type safety.

Support for ALTER queries.

This was never possible before in phantom, and now from 1.7.0 onwards we feature full support for using ALTER queries.

Issues and questions
Adopters
Roadmap
Tutorials on phantom and Cassandra
Commercial support
Using phantom in your project
Phantom columns
Data modeling with phantom
Querying with phantom
- Query API
- Using Scala Futures to query
- Examples with Scala Futures
- Using Twitter Futures to query
- Examples with Twitter Futures
- Using partition tokens
- Partition token operators
- Compound Keys
- Composite Keys
- Cassandra Time Series and ClusteringOrder
- Secondary Keys
Asynchronous iterators
Batch statements
Thrift integration
Apache ZooKeeper integration
The phantom testkit
Contributing to phantom
Using GitFlow as a branching model
Scala style guidelines for contributions
Copyright

Name		Name	Last commit message	Last commit date
Latest commit History 500 Commits
phantom-connectors/src/main/scala/com/websudos/phantom/connectors		phantom-connectors/src/main/scala/com/websudos/phantom/connectors
phantom-dsl/src		phantom-dsl/src
phantom-example/src		phantom-example/src
phantom-sbt/src/main/scala/com/websudos/phantom/sbt		phantom-sbt/src/main/scala/com/websudos/phantom/sbt
phantom-scalatra-test/src		phantom-scalatra-test/src
phantom-spark/src/main/scala/com/websudos/phantom/spark		phantom-spark/src/main/scala/com/websudos/phantom/spark
phantom-testkit/src/main		phantom-testkit/src/main
phantom-thrift/src		phantom-thrift/src
phantom-udt/src		phantom-udt/src
phantom-zookeeper/src/main/scala/com/websudos/phantom/zookeeper		phantom-zookeeper/src/main/scala/com/websudos/phantom/zookeeper
project		project
.gitignore		.gitignore
.travis.yml		.travis.yml
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE.txt		LICENSE.txt
README.md		README.md
changelog.md		changelog.md
scalastyle-config.xml		scalastyle-config.xml

phantom columns	Java/Scala type	Cassandra type
BlobColumn	java.nio.ByteBuffer	blob
BigDecimalColumn	scala.math.BigDecimal	decimal
BigIntColumn	scala.math.BigInt	varint
BooleanColumn	scala.Boolean	boolean
DateColumn	java.util.Date	timestamp
DateTimeColumn	org.joda.time.DateTime	timestamp
DoubleColumn	scala.Double	double
EnumColumn	scala.Enumeration	text
FloatColumn	scala.Float	float
IntColumn	scala.Int	int
InetAddressColumn	java.net.InetAddress	inet
LongColumn	scala.Long	long
StringColumn	java.lang.String	text
UUIDColumn	java.util.UUID	uuid
TimeUUIDColumn	java.util.UUID	timeuuid
CounterColumn	scala.Long	counter
StaticColumn<type>	<type>	type static

phantom columns	Java/Scala type	Cassandra columns
OptionalBlobColumn	Option[java.nio.ByteBuffer]	blob
OptionalBigDecimalColumn	Option[scala.math.BigDecimal]	decimal
OptionalBigIntColumn	Option[scala.math.BigInt]	varint
OptionalBooleanColumn	Option[scala.Boolean]	boolean
OptionalDateColumn	Option[java.util.Date]	timestamp
OptionalDateTimeColumn	Option[org.joda.time.DateTime]	timestamp
OptionalDoubleColumn	Option[scala.Double]	double
OptionalEnumColumn	Option[scala.Enumeration]	text
OptionalFloatColumn	Option[scala.Float]	float
OptionalIntColumn	Option[scala.Int]	int
OptionalInetAddressColumn	Option[java.net.InetAddress]	inet
OptionalLongColumn	Option[Long]	long
OptionalStringColumn	Option[java.lang.String]	text
OptionalUUIDColumn	Option[java.util.UUID]	uuid
OptionalTimeUUIDColumn	Option[java.util.UUID]	timeuuid

phantom columns	Cassandra columns
ListColumn.<type>	list<type>
SetColumn.<type>	set<type>
MapColumn.<type, type>	map<type, type>
JsonColumn.<type>	text
JsonListColumn.<type>	list<text>
JsonSetColumn.<type>	set<type>

phantom columns	Cassandra columns
ThriftColumn.<type>	text
ThriftListColumn.<type>	list<text>
ThriftSetColumn.<type>	set<text>
ThriftMapColumn.<type, type>	map<text, text>

Method name	Description
`tracing_=`	The Cassandra utility method. Enables or disables tracing.
`queryString`	Get the output CQL 3 query of a phantom query.
`consistencyLevel`	Retrieves the consistency level in use.
`consistencyLevel_=`	Sets the consistency level to use.
`retryPolicy`	Retrieves the RetryPolicy in use.
`retryPolicy_=`	Sets the RetryPolicy to use.
`serialConsistencyLevel`	Retrieves the serial consistency level in use.
`serialConsistencyLevel_=`	Sets the serial consistency level to use.
`forceNoValues_=`	Sets the serial consistency level to use.
`routingKey`	Retrieves the Routing Key as a ByteBuffer.

Method name	Description
`where`	The `WHERE` clause in CQL
`and`	Chains several clauses, creating a `WHERE ... AND` query
`orderBy`	Adds an `ORDER_BY column_name` to the query
`allowFiltering`	Allows Cassandra to filter records in memory. This is an expensive operation.
`limit`	Sets the exact number of records to retrieve.

Operator name	Description
eqs	The "equals" operator. Will match if the objects are equal
in	The "in" operator. Will match if the object is found the list of arguments
gt	The "greater than" operator. Will match a the record is greater than the argument and exists
gte	The "greater than or equals" operator. Will match a the record is greater than the argument and exists
lt	The "lower than" operator. Will match a the record that is less than the argument and exists
lte	The "lower than or equals" operator. Will match a the record that is less than the argument and exists

Method name	Description
`value`	A type safe Insert query builder. Throws an error for `null` values.
`valueOrNull`	This will accept a `null` without throwing an error.
`ttl`	Sets the "Time-To-Live" for the record.

Method name	Description	Scala result type
`future`	Executes a command and returns a `ResultSet`. This is useful when you don't need to return a value.	`scala.concurrent.Future[ResultSet]`
`execute`	Executes a command and returns a `ResultSet`. This is useful when you don't need to return a value.	`com.twitter.util.Future[ResultSet]`
`one`	Executes a command and returns an `Option[T]`. Use this when you are selecting and you only need one value. Adds `LIMIT 1` to the CQL query.	`scala.concurrent.Future[Option[Record]]`
`get`	Executes a command and returns an `Option[T]`. Use this when you are selecting and you only need one value. Adds`LIMIT 1` to the CQL query.	`com.twitter.util.Future[Option[Record]]`
`fetch`	Returns a sequence of matches. Use when you expect more than 1 match.	`scala.concurrent.Future[Seq[Record]]`
`collect`	Returns a sequence of matches. Use when you expect more than 1 match.	`com.twitter.util.Future[Seq[Record]`
`fetchSpool`	This is useful when you need the underlying ResultSpool.	`com.twitter.concurrent.Spool[T]]`
`fetchEnumerator`	This is useful when you need the underlying Play based enumerator.	`play.api.libs.iteratee.Enumerator[T]`

Name	Description
`put`	Adds a (key -> value) pair to the map
`putAll`	Adds multiple (key -> value) pairs to the map

Name	Description	ZooKeeper support	Auto-embedding support
CassandraFlatSpec	Simple FlatSpec trait mixin, based on `org.scalatest.FlatSpec`	No	Yes
CassandraFeatureSpec	Simple FeatureSpec trait mixin, based on `org.scalatest.FeatureSpec`	No	Yes
BaseTest	ZooKeeper powered FlatSpec trait mixin, based on `org.scalatest.FlatSpec`	Yes	Yes
FeatureBestTest	ZooKeeper powered FeatureSpec trait mixin, based on `org.scalatest.FeatureSpec`	Yes	Yes

Name	Description
`prepend`	Adds an item to the head of the list
`prependAll`	Adds multiple items to the head of the list
`append`	Adds an item to the tail of the list
`appendAll`	Adds multiple items to the tail of the list
`discard`	Removes the given item from the list.
`discardAll`	Removes all given items from the list.
`setIdx`	Updates a specific index in the list

Name	Description
`add`	Adds an item to the set
`addAll`	Adds multiple items to the set
`remove`	Removes the given item from the set.
`removeAll`	Removes all given items from the set.

Operator name	Description
eqsToken	The "equals" operator. Will match if the objects are equal
gtToken	The "greater than" operator. Will match a record that is greater than the argument
gteToken	The "greater than or equals" operator. Will match a record that is greater than or equal to the argument
ltToken	The "lower than" operator. Will match a record that is less than the argument and exists
lteToken	The "lower than or equals" operator. Will match a record that is less than or equal to the argument

License

mindcandy/phantom

Folders and files

Latest commit

History

Repository files navigation

Using phantom

Scala 2.10 and 2.11 releases

Latest versions

Version highlights and upcoming features

Breaking API changes in Phantom 1.8.0 and beyond.

Support for ALTER queries.

Adopters

Roadmap

Commercial support

Contributing to phantom

YourKit Java Profiler

About

Resources

License

Stars

Watchers

Forks

Languages