Russell Spitzer's Blog: Russ Blog Home

Scratch Project - Gravity Chomper

03 March 2017

Project Walk-through for the Scratch Class Taught at The Made. A simple character moves around eating floating objects. Gravity can be controlled to either draw or repel nearby objects.

Read Post

Concurrency in Spark

27 February 2017

How does Spark actually execute code and how can I do concurrent work within a Spark execution pipeline? Spark is complicated, Concurrency is complicated, and distributed systems are also complicated. To answer some common questions I’m going to go over some basic details on what Spark is doing and how you can control it.

Read Post

Ordering with saveToCassandra

04 February 2017

Responding to a question on Stack Overflow http://stackoverflow.com/questions/42020173/savetocassandra-is-there-any-ordering-in-which-the-rows-are-written/42031425#42031425

Read Post

Spark Applications are Fat

03 February 2017

Spark Submit is great and you should use it.

Read Post

Distributed Locks are Hard

19 December 2016

I recently was answering a Stack Overflow Question which made me start thinking a bit about locking and some assumptions made in Distributed systems. In this case we had what I find is a pretty common error in distributed systems and particularly with Cassandra.

Read Post

Is the Spark Cassandra Connector Asynchronous / How Spark Works

23 August 2016

Are the Spark Cassandra Connector Api’s Async?

Read Post

Pruning and Spark DataSources, A Love Story

22 July 2016

I was looking through a pile of Jiras and I noticed an interesting complaint that DataFrame pruning was broken for the Spark Cassandra Connector. The ticket noted that even when very specific columns were selected, it seemed like the Connector was pulling all of the rows from the source Cassandra table. This is surprising, since that particular part of the connector code has some rather heavy testing coverage and there haven’t been any comments on this feature not working from anyone else. Compared to predicate pushdown, pruning is easy so what went wrong?

Read Post

Debugging Catalyst and Predicate Pushdown with Spark Cassandra Connector

18 April 2016

Making sure your code is actually pushing down predicates to C* is slightly confusing. In this post we’ll go over the basics on setting up debugging and how to workaround a few common issues.

Read Post

Utilizing Multiple C* Clusters using the Spark Cassandra Connector Part II

16 February 2016

Talking to Multiple Clusters, Now with Spark SQL

Read Post

Sbt Assembly (Fat Jars) With Spark

22 January 2016

Classpaths are almost always the first error folks run into when writing custom applications for Spark. The difficult usually centers around the fact that there are many Spark processes and their special class-loaders. Most folks get around these issue by building Fat Jars with sbt assembly but not everyone needs to do this.

Read Post

Russell Spitzer's Blog

Scratch Project - Gravity Chomper

Concurrency in Spark

Ordering with saveToCassandra

Spark Applications are Fat

Distributed Locks are Hard

Is the Spark Cassandra Connector Asynchronous / How Spark Works

Pruning and Spark DataSources, A Love Story

Debugging Catalyst and Predicate Pushdown with Spark Cassandra Connector

Utilizing Multiple C* Clusters using the Spark Cassandra Connector Part II

Sbt Assembly (Fat Jars) With Spark

About

Recent Posts

Elsewhere