Thinking about BI: Infographics is the next phase…

I have been thinking about BI… prompted by a friend, Frank Bien, who is the CTO of Looker (you are welcome, Frank, for the plug…) but this post is about a  trend in BI that is worth exploring… and only maybe about Looker or any other tool. BI was originally about reporting… in its very first…

Who is How Columnar? Exadata, Teradata, and HANA – Part 2: Column Processing

In my last post here I suggested that there were three levels of maturity around column orientation and described the first level, PAX, which provides columnar compression. This apparently is the level Exadata operates at with its Hybrid Columnar Compression. In this post we will consider the next two levels of maturity: early materialized column…

Memory Trends and HANA

If the Gartner estimates here are correct… then DRAM prices will fall 50% per year per year over the next several years… and then in 2015 non-volatile RAM (see the related articles below) will become generally available. It has been suggested that memory prices will fall slower than data warehouses will grow (see here). That does not seem to…

Some Unaudited HANA Performance Numbers

The following performance numbers are being reported publicly for HANA: HANA scans data at 3MB/msec/core On a high-end 80-core server this translates to 240GB/sec per node HANA inserts rows at 1.5M records/sec/core Or 120M records/sec per node… Aggregates 12M records/sec/core Or 960M records per node… These numbers seem reasonable: A 100X improvement over disk-based scan…

The Cost of Dollars per Terabyte

Let me be blunt: using price per terabyte as the measure of a data warehouse platform is holding back the entire business intelligence industry. Consider this… The Five Minute Rule (see here and here) clearly describes the economics of HW technology… suggesting exactly when data should be retained in memory versus when it may be moved…

My 2 Cents: Greenplum 1Q2013

Since my blogs tend to be in response to some stimulus they may not reflect a holistic view on any particular product. The “My 2 Cents” series will try to provide a broader view… Please consider this as you read on… Summary From a technical perspective, Greenplum is my favorite data warehouse database. Built on…