The Crayon Blog

Why you don’t need petabytes for a big data opening

Tech Articles | Published April 11, 2014 | Tejeswini Kashyappan

Big data isn’t necessarily big and can be as much about the complexities of processing information as about volumes or data types.

Personal genetic-profiling services such as 23andMe, which charges $99 to sequence an individual’s genome, illustrate the point, according to Forrester principal analyst Mike Gualtieri.

The resulting data from one individual’s sequenced DNA is only about 800MB, he told an audience at last week’s Hadoop Summit in Amsterdam,

“That’s not a lot. Would you call that big data? If I said 800MB is big data, I’d get laughed out of the room,” Gualtieri said.

“But within that, there are four billion pieces of information and lots of patterns. So it’s a big processing challenge, it’s a big compute challenge. You don’t have to have petabytes of data to have a big-data opportunity or issue.”

In fact, big data is a self-defining concept that Gualtieri described as the frontier of an individual company’s ability to store, process and access data to achieve business outcomes — and those outcomes are mostly about understanding and serving customers.

Recent Blogs

October 11, 2024

Categories

The Crayon Blog

Tribute to Mr. Ratan Tata: A Life of Purpose, Vision, and Humanity

September 18, 2024

Categories

The Crayon Blog

The Superlative of Efficiency is Here!

May 23, 2024

Categories

The Crayon Blog

Navigating the Future of Lending: How AI is Revolutionizing Consumer Credit

April 16, 2024

Categories

The Crayon Blog

Is the GenAI out of the bottle?

Subscribe to the Crayon Blog. Get the latest posts in your inbox!

SIGN UP HERE

The Crayon Blog

Why you don’t need petabytes for a big data opening

Tech Articles | Published April 11, 2014 | Tejeswini Kashyappan

Big data isn’t necessarily big and can be as much about the complexities of processing information as about volumes or data types.

Personal genetic-profiling services such as 23andMe, which charges $99 to sequence an individual’s genome, illustrate the point, according to Forrester principal analyst Mike Gualtieri.

The resulting data from one individual’s sequenced DNA is only about 800MB, he told an audience at last week’s Hadoop Summit in Amsterdam,

“That’s not a lot. Would you call that big data? If I said 800MB is big data, I’d get laughed out of the room,” Gualtieri said.

“But within that, there are four billion pieces of information and lots of patterns. So it’s a big processing challenge, it’s a big compute challenge. You don’t have to have petabytes of data to have a big-data opportunity or issue.”

In fact, big data is a self-defining concept that Gualtieri described as the frontier of an individual company’s ability to store, process and access data to achieve business outcomes — and those outcomes are mostly about understanding and serving customers.

Recent Blogs

October 11, 2024

Categories

The Crayon Blog

Tribute to Mr. Ratan Tata: A Life of Purpose, Vision, and Humanity

September 18, 2024

Categories

The Crayon Blog

The Superlative of Efficiency is Here!

May 23, 2024

Categories

The Crayon Blog

Navigating the Future of Lending: How AI is Revolutionizing Consumer Credit

April 16, 2024

Categories

The Crayon Blog

Is the GenAI out of the bottle?

Subscribe to the Crayon Blog. Get the latest posts in your inbox!

SIGN UP HERE

The Crayon Blog

Why you don’t need petabytes for a big data opening

Tech Articles | Published April 11, 2014 | Tejeswini Kashyappan

Big data isn’t necessarily big and can be as much about the complexities of processing information as about volumes or data types.

Personal genetic-profiling services such as 23andMe, which charges $99 to sequence an individual’s genome, illustrate the point, according to Forrester principal analyst Mike Gualtieri.

The resulting data from one individual’s sequenced DNA is only about 800MB, he told an audience at last week’s Hadoop Summit in Amsterdam,

“That’s not a lot. Would you call that big data? If I said 800MB is big data, I’d get laughed out of the room,” Gualtieri said.

“But within that, there are four billion pieces of information and lots of patterns. So it’s a big processing challenge, it’s a big compute challenge. You don’t have to have petabytes of data to have a big-data opportunity or issue.”

In fact, big data is a self-defining concept that Gualtieri described as the frontier of an individual company’s ability to store, process and access data to achieve business outcomes — and those outcomes are mostly about understanding and serving customers.

Recent Blogs

October 11, 2024

Categories

The Crayon Blog

Tribute to Mr. Ratan Tata: A Life of Purpose, Vision, and Humanity

September 18, 2024

Categories

The Crayon Blog

The Superlative of Efficiency is Here!

May 23, 2024

Categories

The Crayon Blog

Navigating the Future of Lending: How AI is Revolutionizing Consumer Credit

April 16, 2024

Categories

The Crayon Blog

Is the GenAI out of the bottle?

Subscribe to the Crayon Blog. Get the latest posts in your inbox!

SIGN UP HERE

The Crayon Blog

Why you don’t need petabytes for a big data opening

Tech Articles | Published April 11, 2014 | Tejeswini Kashyappan

Big data isn’t necessarily big and can be as much about the complexities of processing information as about volumes or data types.

Personal genetic-profiling services such as 23andMe, which charges $99 to sequence an individual’s genome, illustrate the point, according to Forrester principal analyst Mike Gualtieri.

The resulting data from one individual’s sequenced DNA is only about 800MB, he told an audience at last week’s Hadoop Summit in Amsterdam,

“That’s not a lot. Would you call that big data? If I said 800MB is big data, I’d get laughed out of the room,” Gualtieri said.

“But within that, there are four billion pieces of information and lots of patterns. So it’s a big processing challenge, it’s a big compute challenge. You don’t have to have petabytes of data to have a big-data opportunity or issue.”

In fact, big data is a self-defining concept that Gualtieri described as the frontier of an individual company’s ability to store, process and access data to achieve business outcomes — and those outcomes are mostly about understanding and serving customers.

Recent Blogs

October 11, 2024

Categories