In my previous rounds with DataWarehouse oriented engines I used single table without joins, and with small (as for DW) datasize (see http://www.mysqlperformanceblog.com/2009/10/02/analyzing-air-traffic-performance-with-infobright-and-monetdb/, http://www.mysqlperformanceblog.com/2009/10/26/air-traffic-queries-in-luciddb/, http://www.mysqlperformanceblog.com/2009/11/02/air-traffic-queries-in-infinidb-early-alpha/). Addressing these issues, I took Star Schema Benchmark, which is TPC-H modification, and tried run queries against InfoBright, InfiniDB, LucidDB and MonetDB. I did not …
[Read more]Pentaho Solutions
Pentaho Solutions, Business Intelligence and Data Warehousing with Pentaho and MySQL. By Roland Bouman and Jos van Dongen, Wiley 2009. Page count: about 570 pages. (Here’s a link to the publisher’s site.)
The book is big in part because it’s about a GUI tool, so there are the requisite number of screenshots (but not too many). It is structured into four parts, each on a different topic.
The first part is 4 chapters on getting started with Pentaho: from a quick-start through …
[Read more]
This is next post in series
Analyzing air traffic performance with InfoBright
and MonetDB
Air traffic queries in LucidDB
Air traffic queries in InfiniDB: early alpha
Let me explain the reason of choosing these engines. After initial three posts I am often asked "What is baseline ? Can we compare results with standard MySQL engines ?". So there come MyISAM to consider it as base point to see how column-oriented-analytic engines are better here.
However, take into account, that for MyISAM we need to choose proper indexes to execute queries …
[Read more]
As Calpont announced availability of InfiniDB I surely
couldn't miss a chance to compare it with previously tested
databases in the same environment.
See my previous posts on this topic:
Analyzing air traffic performance with InfoBright
and MonetDB
Air traffic queries in LucidDB
I could not run all queries against InfiniDB and I met some hiccups during my experiment, so it was less plain experience than with other databases.
So let's go by the same steps:
Load data
InfiniDB supports MySQL's LOAD DATA
statement and
it's own colxml / cpimport
utilities. As …
Accidentally me and Baron played with InfoBright (see http://www.mysqlperformanceblog.com/2009/09/29/quick-comparison-of-myisam-infobright-and-monetdb/) this week. And following Baron's example I also run the same load against MonetDB. Reading comments to Baron's post I tied to load the same data to LucidDB, but I was not successful in this.
I tried to analyze a bigger dataset and I took public available
data
http://www.transtats.bts.gov/DL_SelectFields.asp?Table_ID=236&DB_Short_Name=On-Time
about USA domestic flights with information about flight length
and delays.
The data is available from 1988 to 2009 in chunks per month, so I downloaded 252 files (for 1988-2008 years) with …
[Read more]
Hi!
Yesterday I had the honour of presenting my
mini-bi/datawarehousing tutorial "Starring Sakila" for MySQL
University. I did a modified version of the presentation I did together with Matt Casters at
the MySQL user's conference 2009. The structure of the
presentation is still largely the same, although I condensed
various bits, and I added practical examples of setting up the
ETL process and creating a Pentaho Analysis View (OLAP pivot
table) on top of a Mondrian Cube.
The slides, session recording, and materials such as SQL script,
pentaho data integration jobs and transformations, and Sakila
Rentals Cube for Mondrian are all available here on MySQL Forge.
Copyright Notice
Presentation slides, and …
Ok folks, just after testing out some new changes to this week’s release - I’ve uploaded a new version. Here are the following changes:
Changelog 1.2a
- changes line thickness to 1 for all graphs, 2 was too thick
for running at 5 minute intervals for the poller
- added several scripts for running and logging monitor agent and
report_generator
- changes talkback sripts to one script instead of 3
When your company decides that "it is time to build a data warehouse", what thoughts come to mind?1) A magical fairy ice cream land where data is presented in chocolate shells for everyone to digest perfectly;2) A big literal warehouse in the industrial section of town with rusty old containers;3) Another place to put data, which means another place for you to track and monitor additional
One of the enhancements I added to MySQL Archiver in the recent release was listed innocently in the changelog as "Destination plugins can now rewrite the INSERT statement." Not very exciting or informative, huh? Keep reading.
In the first two articles in this series, I discussed archiving basics, relationships and dependencies, and specific archiving techniques for online transaction processing (OLTP) database servers. This article covers how to move the data from the OLTP source to the archive destination, what the archive destination might look like, and how to un-archive data. If you can un-archive easily and reliably, a whole new world of possibilities opens up.