Showing entries 121 to 130 of 164
« 10 Newer Entries | 10 Older Entries »
Displaying posts with tag: hadoop (reset)
451 CAOS Links 2011.05.10

EMC launches Greenplum HD. DataStax releases Brisk. And more.

# EMC launched its Greenplum HD Hadoop distribution, with the support of Jaspersoft, Pentaho, and SnapLogic, among others.

# DataStax …

[Read more]
451 CAOS Links 2011.05.03

Novell sold to Attachmate. Barnes & Noble throws the book at Microsoft. And more.

Follow 451 CAOS Links live @caostheory on Twitter and Identi.ca, and daily at Paper.li/caostheory
“Tracking the open source news wires, so you don’t have to.”

# Novell closed its acquisition by Attachmate and its patent sale to CPTN.

# Attachmate’s CEO discussed the company’s plans for SUSE Linux.

# Barnes & Noble …

[Read more]
451 CAOS Links 2011.04.12

Groklaw declares victory. Cloudera updates Hadoop distro. And more.

Follow 451 CAOS Links live @caostheory on Twitter and Identi.ca, and daily at Paper.li/caostheory
“Tracking the open source news wires, so you don’t have to.”

# Groklaw claimed victory, will stop publishing new articles on May 16.

# Cloudera released version 3 of its Hadoop distribution.

# VoltDB released version 1.3 of its open source distributed in-memory database.

# Black Duck grew sales by 51% in Q1.

# eXo and Convertigo …

[Read more]
451 CAOS Links 2011.03.25

Red Hat grows revenue 20%+. Google withholding Honeycomb source code. And more.

Follow 451 CAOS Links live @caostheory on Twitter and Identi.ca, and daily at Paper.li/caostheory
“Tracking the open source news wires, so you don’t have to.”

# Red Hat reported Q4 revenue up 25% to $245m, FY revenue up 22% to $909m

# Google is withholding the source code to Honeycomb for the foreseeable future.

# Rick Clark explained why he left Rackspace amid concerns that the company is exerting too much control over OpenStack.

# DataStax …

[Read more]
Who/What to acquire next

Well as predicted, with Aster Data recently being picked up by Teradata most of the key new generation MPP distributed analytics vendors have been acquired (Aster Data, Vertica, Netezza & Greenplum).  This had to happen and was expected to happen.  The MPP Analytics startup “revolution” is over and these technologies will now be integrated into the mainstream.

So what’s next?  As we now, if you are a massive multi-national software company it is a lot less risky to incrementally innovate and leave the development of “game changing” technologies to startups that can be acquired after they prove both the tech and the market.  So what follows MPP? …

[Read more]
What’s hot in Big Data startups?

There are so, so many big data platforms in play at the moment it can be confusing for developers to know where to start.  For startups it used to be simple, MySQL, but dust clouds were created when all the NoSQL platforms started to crash the party 18 months or so ago.  But I do see the dust begin to settle and we are starting to see some market “leaders” appear.  A very unscientific approach is to list the technologies I hear about in the “big data startup” world on a daily basis.  These are, in no particular order:

  • MySQL - yes it is still very much hanging in there despite the Oracle acquisition.  MySQL has been helped by technologies such as AWS RDS and Xeround making it more digestible for big data startups who want to minimize operational overheads.
[Read more]
Q&A with Stephen Baker of "Final Jeopardy"

IBM's Watson natural language Question & Answer system made headlines recently with its primetime debut on Jeopardy.  Despite a few embarassing answers, Watson trounced top Jeopardy players Brad Rutter and Ken Jennings.  Watson is built from 90 IBM Power 750 IBM Linux servers with 16 terabytes of memory providing 80 Teraflops of processing power.  Watson is perhaps the most famous "Big Data" systems out there.  Watson's knowledge base consists of 200 million pages of text data that is pre-processed using  …

[Read more]
Free Hadoop class in Dallas

Cloudera Instructor Tom Hanlon will be presenting a free class on Hadoop Tuesday March 15th on Dallas. Tom is a familiar face to MySQLers in the North Texas area having previous taught many MySQL classes. Pizza and Drinks will be provided.

This will be an excellent opportunity for MySQL DBAs to learn from booth a MySQL and Hadoop expert. Hadoop is a computational paradigm named Map/Reduce, where the application is divided into many small fragments of work which may be executed any node in the cluster.

Register Here as they may need to shift locations to find the anticipated crowd.

451 CAOS Links 2011.02.01

Hudson developers vote for Jenkins. SugarCRM turns cash flow positive. And more.

Follow 451 CAOS Links live @caostheory on Twitter and Identi.ca, and daily at Paper.li/caostheory
“Tracking the open source news wires, so you don’t have to.”

# The Hudson developer community voted overwhelmingly to rename the project Jenkins, and will continue without Oracle.

# SugarCRM turned cash flow positive in 2010 as billings increased 52% year on year.

# BonitaSoft announced the release of version 5.4 of Bonita Open Solution.

# WANdisco …

[Read more]
Hadoop Cluster Setup on Debian Lenny

Today I will describe the setup of a Hadoop / HDSF multi-node cluster on Debian Lenny with a redundant Namenode using DRBD and Heartbeat, four Datanodes and Tasktracker, a Backup- Checkpointnode and Rack awareness.

Hadoop Cluster Setup on Debian Lenny purposes

This article descibes how to setup a hadoop (version 0.21.0) cluster on debian lenny (version 5.x). I will not describe how to use MapReduce.

general

Hadoop is a framework for distributed computing written in Java. The project includs the following subprojects:

  • HDFS: A distributed file system
  • MapReduce: A framework for distributed large data processing

list of references

[Read more]
Showing entries 121 to 130 of 164
« 10 Newer Entries | 10 Older Entries »