Authors

Sebastian Good

Posts by this author

February 27, 2018

Reinforcement Learning: Bush Pilots in the Outback

Reinforcement Learning of a deep neural network has been applied to the problem of supply chain logistics: In a stochastic environment, how to optimize pickup and delivery schedules.

Blog

June 14, 2017

Escape your On-Premise Prison and Decrease Costs

Trying to modernize monolithic legacy applications is hard: these applications are core drivers of the business and the risk of messing them up is too great. However, as time goes on, the cost of maintaining these monoliths grows.

Blog

January 28, 2016

Graph Day 2016

A case study at Graph Day recounting a client study we did to see whether their database could be reorganized to offer improved query performance. We looked at graph databases (OrientDB, Titan, Neo4J) because they thought of their data as graph data, and relational (Postgresql) because that’s what their database already implemented.

Blog

January 11, 2016

Do You Need a Graph Database? (Online Seminar)

‍We are in an era of unprecedented innovation in databases. Data-intensive companies are grappling with whether the many new options — NoSQL, Key-Value, Document, Column Family, Column-Oriented — are appropriate for them. The commercial success of Facebook and LinkedIn makes graph databases a hot area of investigation. Unlike many new databases, they are not a variation on or a simplification of relational databases. Instead they require new ways of thinking and modeling data. In return they can answer truly novel questions.

Blog

November 10, 2015

Graph Database Evaluations

In this post we summarize what sorts of questions we feel like a proof of concept project around graph databases can answer, and how we typically tackle them.

Blog

November 10, 2015

Graph Database Evaluations

We are meeting more people who are interested in looking into the world of graph databases. Palladium has executed proofs–of–concept for clients to help them explore this world. In this post we summarize what sorts of questions we feel like a proof of concept project can answer, and how we typically tackle them. For our presentation at Graph Day, we’ll be walking through one in particular, but really there are a variety of answers you may want.

Blog

October 21, 2015

What’s New in Titan?

As part of the work we’re doing to refresh our graph database evaluation for a couple of clients (and our upcoming talk at Graph Day!) we took Titan 1.0out for a spin last week. We’ll be doing more in-depth explorations on some in-house and public datasets over the next few weeks, but here’s some preliminary impressions based on a contrast with the Titan we came to know a year ago or so.

Blog

September 25, 2015

Josh Perryman to present at GraphDay Austin!

Our client’s legacy system held graph-like data in a relational database, but new customers’ data sizes were crippling performance and scale. As part of an overall architectural rejuvenation, we evaluated migrating their data to graph and relational schemas to determine if query performance and scalability could be improved. With representative data in hand, we designed alternate relational schemas, graph database designs, and triple store designs, benchmarking performance and noting subjective measures such as ease of use and fluency of the query language. Vendors included PostgreSQL, Neo4J, Titan, and AllegroGraph. Follow-up studies included other vendors. The results surprised us, leading to a hybrid relational and graph recommendation. We have implemented the first milestone over the last year. Follow-up work shows that graph DB vendors have come a long way even in that time. This methodology and information in this case study should be useful to teams choosing a database engine, whether graph or relational, for their next project.Our client’s legacy system held graph-like data in a relational database, but new customers’ data sizes were crippling performance and scale. As part of an overall architectural rejuvenation, we evaluated migrating their data to graph and relational schemas to determine if query performance and scalability could be improved. With representative data in hand, we designed alternate relational schemas, graph database designs, and triple store designs, benchmarking performance and noting subjective measures such as ease of use and fluency of the query language. Vendors included PostgreSQL, Neo4J, Titan, and AllegroGraph. Follow-up studies included other vendors. The results surprised us, leading to a hybrid relational and graph recommendation. We have implemented the first milestone over the last year. Follow-up work shows that graph DB vendors have come a long way even in that time. This methodology and information in this case study should be useful to teams choosing a database engine, whether graph or relational, for their next project.

Blog

August 6, 2015

Amazon’s Elastic File System: Kicking the Tires

If you want to store lots of data, the Amazon cloud has buckets and glaciers for you, but not a shared file system.

Blog

June 25, 2015

Debugging dlopen with rpath and ldd

This post describes how to debug some library dependency issues on a Linux machine. I built a nightly version of Julia (a language for technical computing that we’re pretty excited about here), on Linux, deployed it to a different machine, but then it failed to launch, complaining about

Blog

April 9, 2015

This One Simple Trick For Easier And Faster Domain Objects!

What if, no matter how you try to simplify, your aggregate root is pretty darn big? Writing application services to handle these large entities is a challenge. We run into this all the time with scientific computing.

Blog

February 10, 2015

Outside the (Enterprise) Prison Walls

Fascinating though it is, I’m happy to observe prison life from the outside through shows like Oz or Orange is the New Black. It’s the strange way prison mirrors the outside world that’s so compelling. They have police (gangs) and wars (gangs) and commerce (smuggling) and currency (cigarettes, stamps, etc.) just the same as the free world.

Blog

December 23, 2014

Bad Test Driven Development

It’s hard to hire good developers. We face the same struggles everyone does in sorting the good from the bad. One of my favorite tropes is the resume as failed inductive proof.

Blog

December 16, 2014

Custom Lifestyles for Castle Windsor (Part 2)

In part 1 of this series, we looked at how using an IOC container helped us separate the concerns of the construction of a ZookeeperClient from the concerns of using it in service handlers. In this one, we look at how the IOC container can transparently help us manage a singleton which leaks memory.

Blog

December 9, 2014

What Good is an IOC, Anyway? (Part 1)

IOC containers elicit strong emotions from programmers, ranging from reverence to disdain.

Blog

October 7, 2014

Little Performance Explorations: ISPC

ISPC Performance

Blog

September 30, 2014

Little Performance Explorations: Julia

Julia Performance

Blog

September 23, 2014

Little Performance Explorations: C++

This the third post in a series, which started with a description of the problem, and continued with an F# benchmark.

Blog

September 16, 2014

Little Performance Explorations: F#

There is still a substantial gap between this result and the result we’ll find with other environments, and my guess is this is a code generation issue, i.e. instruction selection and scheduling, but I’m not an expert in this area either!

Blog

September 12, 2014

Tiny Julia, F#, and C++ Explorations

I’ve got 405MB of 3D seismic data from Teapot Dome sitting in my file cache, and I want to give you a quick view of some of its summary statistics. How long do you think you should have to wait? If you’re working in Excel, you might be happy with a few minutes. A .NET programmer — used to endless database calls and virtual machines in his line of work — wouldn’t be too surprised at a few seconds, or tens of seconds. Long enough to fire up a spinny cursor and send you to Facebook, or whatever your work-day sin is.

Blog

July 15, 2014

Bind to the Abstraction, not the Infrastructure

In a previous post, we talked about untangling multiple UI controls so that they could be developed independently, but react to user interaction in a synchronized manner. Let’s posit for a moment updates to a line on one map control should cause re-rendering of a cross-section control associated with that line, but that the two controls are in different browsers, or even on different machines.

Blog

July 8, 2014

Reactive Synchronized UI Components

Reactive Synchronized UI Components: how I learned to stop worrying and love synchronized maps and cross-sections.

Blog

June 14, 2014

ZooKeeper Usage 5: C# IObservable

One of ZooKeeper’s nice features is the ability to set up a watch on a node, and be updated whenever it changes.

Blog

June 10, 2014

ZooKeeper Usage 4: .NET API and Tasks

ZooKeeper’s “native” client APIs are C and Java. If you’re programming in .NET (or Python, or a few other languages), the docs helpfully point out that some friendlies have programmed clients that “might” work for you. “Might” is frustrating, as is the possibility that the libraries are behind. So we used the Java version anyway, and made it a little more idiomatic .NET. It turns out to be a nice look at how to use Java from .NET, and how to implement Task and IObservable patterns by by hand.

Blog

June 3, 2014

ZooKeeper Usage 3: Where is the ZooKeeper?

Okay, so you’re sold on using ZooKeeper to be your service locator or configuration repository. Your services will all talk to ZooKeeper when they start up to find out who they are, who their neighbors are, and generally how to get on with all the other animals at the zoo. But what service locator do you use to find out where ZooKeeper itself is? (ZooKeeper is actually in one or more places, since it typically runs on multiple servers in a production environment.) The answer probably depends on the scope of your problem: a 10,000 node cluster will be different than a few dozen services. Your best options are drawn from the service locator patterns already built into your OS or environment. Here we’ll talk about 3 options.

Blog

May 27, 2014

ZooKeeper Usage 2: Observable<ZooKeeper>

In the last post, we used ZooKeeper as a service registry. When services started, they registered with ZooKeeper at a pre-agreed place. (/services/{dataset-name}). Clients could list the data servers available and decide which ones to connect to, or request that new ones could be launched. Thanks to ephemeral nodes, servers can crash and their registry entries are automatically deleted. Today we’ll talk about three use cases for watching changes in ZooKeeper.

Blog

May 20, 2014

ZooKeeper Usage 1: Ephemeral Nodes

Zookeeper is a distributed database originally developed as part of the Hadoop project. It’s spawned several imitators: Consul, etcd, and Doozerd (itself a clone of chubby.) A lot of the material out there about Zookeeper describes just how it works, not necessarily what you’d use it for. In this series of posts, we’ll cover how we used it at one client — and it how it also got abused.

Blog

May 16, 2014

Your App Is Slow: Measure Early, Measure Often

A good development team will already have a battery of tests in place to ensure their code is correct and can be refactored safely.

Blog

May 13, 2014

Undocumented Languages: Configuration Files

It starts innocently enough. You need a database connection string and to know which tables are safe to cache, and there’s just no sense in putting that in your source code. Right? I mean, why put hard-coded stuff in your programming language?

Blog

April 29, 2014

Your App Is Slow: That Darn Demo Dataset

Especially early in product development, small custom-built datasets are the order of the day.

Blog

April 22, 2014

Your App is Slow: Speed Is a Feature

Most development is feature driven. A developer is on the line to complete a user story or functional requirement, and even if the application gets a little slower, she’d rather have a demo to show during sprint review, instead of watching every one else’s demo.

Blog

April 17, 2014

Command Objects or Interfaces?

Every distributed system eventually requires messages to be written on the wire to be transmitted from one machine to another. In many cases these messages are hidden magic. Using WCF web services, or Thrift RPC, code-generated proxies make remote calls look like function calls.

Blog

April 15, 2014

Your App Is Slow: Boiling the Frog

How many times has a customer come and told you your product was “slow”? In this multi-part series, we will discuss how “slow” happens, and how you can fix it.

Blog

April 6, 2014

It's a Map You're Missing

We were working with a potential client a few weeks ago, trying to figure out if we could help them improve some seismic processing software. The software had excellent science under the covers, but the visual interface was old and tired. Could Palladium help rejuvenate their user experience? Old looking software can imply old or out of date capabilities. Could we make it, well, better?

Blog

January 16, 2014

Where Do Unit Conversions Go?

Unit of measure conversions are a constant concern in scientific code. Most well written scientific domain kernels should be unit un-aware because the equations of nature are generally unit invariant: momentum is mass times velocity whether velocity is in meters per second or furlongs per fortnight. But there are always important places where the actual values matter: water boils at 100 degrees Celsius. Therefore one typically assumes a set of canonical units in the computational domain to make the programming more straightforward. It’s also more efficient and numerically stable to only translate units on the boundaries of the computation domain, rather than littering them throughout.

Blog

February 10, 2012

High-Performance Chemistry

There’s an old standby which tells us that A supercomputer is a device for turning compute-bound problems into I/O-bound problems.

Blog

January 31, 2012

Testing When the Network Is Down

Today I had the lovely experience of being told “the network to the cluster is down” while I was writing some code that was supposed to use the cluster. Was I stalled? How could I test my logic? It turns out we’re rather obsessive about separating interface from implementation, usually via C# interface definitions. In this case, I just went down the road I was going down anyway: making some simple mock objects to model the cluster dependencies. (We use Moq.) Now I don’t really care that the network is down.

Blog

January 30, 2012

Can you hide the units?

In the kind of programming we do — scientific simulations and decision support — modeling is usually the first task, and often the hardest. Structuring your problem the right way can make all the difference in determining whether future code is graceful or spaghetti-like.

Blog

January 22, 2012

When confronted with a problem...

Back in the 1990s, if you wanted to interview a C++ programmer, you’d ask him to write a string class. My programming homework counted words or wrote versions of grep(1). Perl made one form of regular expressions popular with the masses.That’s what you cut your teeth on. Now most every language you learn to program in has a Unicode-compliant standard string library which most people don’t think much about anymore. For most of us, it’s a solved problem. (Though there are fun exceptions.)

Reinforcement Learning: Bush Pilots in the Outback

Escape your On-Premise Prison and Decrease Costs

Graph Day 2016

Do You Need a Graph Database? (Online Seminar)

Graph Database Evaluations

Graph Database Evaluations

What’s New in Titan?

Josh Perryman to present at GraphDay Austin!

Amazon’s Elastic File System: Kicking the Tires

Debugging dlopen with rpath and ldd

This One Simple Trick For Easier And Faster Domain Objects!

Outside the (Enterprise) Prison Walls

Bad Test Driven Development

Custom Lifestyles for Castle Windsor (Part 2)

What Good is an IOC, Anyway? (Part 1)

Little Performance Explorations: ISPC

Little Performance Explorations: Julia

Little Performance Explorations: C++

Little Performance Explorations: F#

Tiny Julia, F#, and C++ Explorations

Bind to the Abstraction, not the Infrastructure

Reactive Synchronized UI Components

ZooKeeper Usage 5: C# IObservable

ZooKeeper Usage 4: .NET API and Tasks

ZooKeeper Usage 3: Where is the ZooKeeper?

ZooKeeper Usage 2: Observable<ZooKeeper>

ZooKeeper Usage 1: Ephemeral Nodes

Your App Is Slow: Measure Early, Measure Often

Undocumented Languages: Configuration Files

Your App Is Slow: That Darn Demo Dataset

Your App is Slow: Speed Is a Feature

Command Objects or Interfaces?

Your App Is Slow: Boiling the Frog

It's a Map You're Missing

Where Do Unit Conversions Go?

High-Performance Chemistry

Testing When the Network Is Down

Can you hide the units?

When confronted with a problem...

The time to build your future with us is now.

Subscribe to our newsletter