But, I have been convinced for some time now of the much greater potential such performance unleashes in the broader and more complex EDW environment. And the vendors have been fairly quiet about this part of the market so far, maybe preferring to leave such more technically and politically complex projects to the big guys. So, it was good to see Vertica's 4.0 announcement last week beginning to address the EDW market with its emphasis on "enterprise ready" and a number of interesting new features and expansions of old functions.
Robust workload and resource management for mixed workloads is a prerequisite for an EDW. Vertica's introduction of administrator-defined resource pools with memory-usage, priority and concurrency settings and the assignment of users to these pools is a big step in this direction. A rework of the optimizer in support of this and other features suggests that Vertica are serious about this support.
Also introduced in V4.0 is a newly optimized single record lookup on primary keys. While aimed at a particular financial analysis use case, this function shows that the database can do more than just crunch columns. Added to the FlexStore feature introduced in V3.5 where newly loaded data is kept in row format in memory for some period of time, I believe we're seeing the database's growing ability to handle the sort of record-level processing often needed in EDWs. The new time-series support in V4.0 also plays directly in EDW needs.
Time and customer experience will, of course, prove if I'm correct, but it seems to me that Vertica is beginning to test my assertion that columnar, MPP databases can be applied to EDWs. And further that their performance characteristics offer the possibility of re-architecting the EDW / data mart divide.
Posted March 3, 2010 11:17 AM
Permalink | No Comments |



