Login

Tags

Factual Blog / Tagged:

open source

How Factual Uses Persistent Storage For Its Real-Time Services

As part of Factual’s Geopulse product suite, we need to be able to absorb and process large amounts of data, and deliver back a somewhat smaller amount of data. There is a significant amount of technology available for the processing stage, but fewer for both the intake and delivery. Today, we’re open sourcing two libraries that...

Open Source Clojure Library for Geospatial Information

At Factual, we process a lot of spatial data. We’re open-sourcing one of our internal libraries for working with geospatial information, especially geohashes. We want all Clojure programmers to be able to answer questions about coordinates, distances, and polygon intersections. We think this library will be particularly useful in concert with our rich suite of location...

New Drake (Make for Data) Feature: Asynchronous Step Execution

A major new feature in the recent Drake 0.1.4 release is allowing the asynchronous execution of steps. Drake can now automatically parallelize the steps in your workflow based on the specified dependencies. If you have steps in your workflow that have high latency, or you have different steps that use different resources (e.g. one step is...

nfu: Command-line Numeric Fu

Note: Explore nfu on Github here We often use the UNIX command line for ad-hoc data crunching. Most of the time we have the good sense to use a better tool after the first 100 characters or so, but sometimes we’ll just blow past the right margin with a string of sort, uniq -c, sort -nr,...

Extend Drake (Make for Data) with a Simple Clojure Project

Earlier this year we released Drake, an open source data workflow tool. It was exciting to see the interest in Drake. We were especially pleased by the quality of outside contributions made to the project (as one example: S3 support – thanks @howech!). But we weren’t happy with the delay on our side in accepting these...