• Skip to main content
  • Skip to secondary menu
  • Skip to primary sidebar
  • Skip to footer
  • Articles
  • News
  • Events
  • Advertize
  • Jobs
  • Courses
  • Contact
  • (0)
  • LoginRegister
    • Facebook
    • LinkedIn
    • RSS
      Articles
      News
      Events
      Job Posts
    • Twitter
Datafloq

Datafloq

Data and Technology Insights

  • Categories
    • Big Data
    • Blockchain
    • Cloud
    • Internet Of Things
    • Metaverse
    • Robotics
    • Cybersecurity
    • Startups
    • Strategy
    • Technical
  • Big Data
  • Blockchain
  • Cloud
  • Metaverse
  • Internet Of Things
  • Robotics
  • Cybersecurity
  • Startups
  • Strategy
  • Technical

Where Is The Real Value in Big Data?

Pete Ianace / 5 min read.
October 5, 2015
Datafloq AI Score
×

Datafloq AI Score: 71.67

Datafloq enables anyone to contribute articles, but we value high-quality content. This means that we do not accept SEO link building content, spammy articles, clickbait, articles written by bots and especially not misinformation. Therefore, we have developed an AI, built using multiple built open-source and proprietary tools to instantly define whether an article is written by a human or a bot and determine the level of bias, objectivity, whether it is fact-based or not, sentiment and overall quality.

Articles published on Datafloq need to have a minimum AI score of 60% and we provide this graph to give more detailed information on how we rate this article. Please note that this is a work in progress and if you have any suggestions, feel free to contact us.

floq.to/JVnlZ

Taking a look at all the activity related to Big Data one should ask the question, how much of Big Data is actually useful. By applying just a little common sense we discover only a small amount.

I have been working with data for over 40 years and if we go back to pre-internet days we experienced what we called data overload and we discovered then that data itself wasnt valuable but only a small slice of that data proved to have a direct impact on actual business decisions. With history in mind what has really changed in solving the most critical issue is related to finding the data that is actually useful. Well volume has certainly increased, but what is important to deal with is that much of the growth in volume comes in the form of unstructured data.

So let me start with what is unstructured data using the definition from Webopedia. The term unstructured data refers to any data that has no identifiable structure. For example, images, videos, email, documents and text are all considered to be unstructured data within a dataset.

While each individual document may contain its own specific structure or formatting that based on the software program used to create the data, unstructured data may also be considered loosely structured data because the data sources do have a structure but all data within a dataset will not contain the same structure. This is in contrast to a database, for example, which is a common example of “structured” data.

So looking back in history we are talking about data overload with an added new twist called unstructured data, which represents much of the new volume being generated. I would suggest that companies that bring a combination of strong data analytical expertise along with a good grasp of both industry standards and compliance rules can offer precise filtering solutions that can identify the most valuable data for the user.


Are you looking for Big Data Jobs or Candidates? Please go to our WORK section


Peeling Back the Onion a Bit More

While there are numerous solutions emerging that address the filtering and analytics of structured data such as Splunk Enterprise that collects, indexes and harnesses all of the fast-moving machine data generated by applications, servers and devicesphysical, virtual and in the cloud. In the case of what Hadoop brings to the table there are many others that have debated its pluses and minuses and I will leave that topic to them.

My view is that the real challenge is to provide cost effective solutions that address the much more complex world of filtering and real-time analytics of unstructured data. While the volume of all data types is expected to grow 800% in the next five years, 80% of that growth will be unstructured data.

I would suggest that companies that possess skills and capabilities that include data modeling, analytics, OCL, and ontology have a leg up when it comes to delivering solutions that leverage both structured and unstructured data. As of today the jury is still out on who will be the players that will offer compelling solutions that address the holy grail of finding the needle in the haystack in the growing world of Big Data.

Big Data, What Role Does Ontology Play?

Ontology

An ontology formally represents knowledge as a hierarchy of concepts within a domain, using a shared vocabulary to denote the types, properties and interrelationships of those concepts.

Ontologies are the structural frameworks for organizing information and are used in artificial intelligence, the Semantic Web, systems engineering, software engineering, biomedical informatics, library science, enterprise bookmarking, and information architecture as a form of knowledge representation about the world or some part of it. The creation of domain ontologies is also fundamental to the definition and use of an enterprise architecture framework.


Interested in what the future will bring? Download our 2023 Technology Trends eBook for free.

Consent

Why is it Important?

It eliminates the need to integrate systems and applications when looking for critical data or trends.

How is it applied and what are the important elements that make it all work?

Ontology uses a unique combination of an inherently agile, graph-based semantic model and semantic search to reduce the timescale and cost of complex data integration challenges. Ontology is rethinking data acquisition, data correlation and data migration projects in a post-Google world.

Why would someone want to develop an ontology?

  • Sharing common understanding of the structure of information among people or software agents is one of the more common goals in developing ontologies. For example, suppose several different Web sites contain medical information or provide medical e-commerce services. If these Web sites share and publish the same underlying ontology of the terms they all use, then computer agents can extract and aggregate information from these different sites. The agents can use this aggregated information to answer user queries or as input data to other applications.
  • Making explicit domain assumptions underlying an implementation makes it possible to change these assumptions easily if our knowledge about the domain changes. Hard-coding assumptions about the world in programming-language code makes these assumptions not only hard to find and understand but also hard to change, in particular for someone without programming expertise. In addition, explicit specifications of domain knowledge are useful for new users who must learn what terms in the domain mean.

Often an ontology of the domain is not a goal in itself. Developing an ontology is akin to defining a set of data and their structure for other programs to use. Problem-solving methods, domain-independent applications, and software agents use ontologies and knowledge bases built from ontologies as data.

What is the Difference between a Taxonomy and an Ontology?

In the world of information management, two common terms that people use are “taxonomy” and “ontology” but people often wonder what the difference between the two terms are.

On the technical side, ontologies imply a broader scope of information. People often refer to a taxonomy as a tree, and extending that analogy Id say that an Ontology is often more of a forest. An ontology might encompass a number of taxonomies, with each taxonomy organizing a subject in a particular way.

A taxonomy generally is limited to a specific subject area, for example Products or Medical Conditions. Taxonomies are valuable when you want to add structure/context to unstructured information to make that information more easily searchable, For example, if a taxonomy is used to tag documents in a search index, then when a user does a keyword search of this content, the Taxonomy can be presented on the left hand side of the search results as filter options for the end user. Multiple taxonomies can be combined together as filters to make for a powerful drill down search experience. This is what you see on many leading ecommerce sites like Amazon or Costco.

Ontologies can be thought of more like a web, with many different types of relationships between all concepts. Ontologies can have infinite number of relationships between concepts and it is easier to create relationships between concepts across different subject domains .For example, you could create a relationship between the topic of “Wood” in a materials taxonomy and “Chair” in a products taxonomy. Relationship types could be “example of”, “Purpose of” or “Part of”.

Ontologies would be used when wanting to create a more sophisticated information model that might be deployed to do advanced natural language processing or text analytics. Ontologies would allow you to better understand things like cause and effect between two concepts within a corpus of information. Ontologies can also power question answering engines: for example, if I search for “Who was the 16th president?” an engine leveraging ontologies could return a specific result of “Abraham Lincoln”.

Ontology in its simplest terms
What is the data
What does it mean
Where is it from
Why do we need it Once we know that, the real data we need is at hand.

Categories: Big Data
Tags: Big Data, databases, ontology, Taxonomy

About Pete Ianace

Visionary leader bringing more than 35 years of experience building successful technology business units, sales channels and companies. Have extensive experience with business startups and turnarounds, having successfully built and spun out four technology companies in the last fifteen years. Have broad experience as a CEO including heading companies in aerospace, defense contracting, telecommunications, Web 2.0 and IP video communications. Have secured funding of more than $125M for various start up companies and secured large contracts with US, European and Asian clients. During the first 20 years of my career, served in a variety of senior management positions including president of Pactel Meridian Systems, a joint venture between Nortel and Pactel.

Primary Sidebar

E-mail Newsletter

Sign up to receive email updates daily and to hear what's going on with us!

Publish
AN Article
Submit
a press release
List
AN Event
Create
A Job Post

Related Articles

The Advantages of IT Staff Augmentation Over Traditional Hiring

May 4, 2023 By Mukesh Ram

The State of Digital Asset Management in 2023

May 3, 2023 By pimcoremkt

Test Data Management – Implementation Challenges and Tools Available

May 1, 2023 By yash.mehta262

Related Jobs

  • Software Engineer | South Yorkshire, GB - February 07, 2023
  • Software Engineer with C# .net Investment House | London, GB - February 07, 2023
  • Senior Java Developer | London, GB - February 07, 2023
  • Software Engineer – Growing Digital Media Company | London, GB - February 07, 2023
  • LBG Returners – Senior Data Analyst | Chester Moor, GB - February 07, 2023
More Jobs

Tags

AI Amazon analysis analytics app application Artificial Intelligence BI Big Data business China Cloud Companies company costs crypto customers Data design development digital engineer engineering environment experience future Google+ government health information machine learning market mobile news public research security services share skills social social media software strategy technology

Related Events

  • 6th Middle East Banking AI & Analytics Summit 2023 | Riyadh, Saudi Arabia - May 10, 2023
  • Data Science Salon NYC: AI & Machine Learning in Finance & Technology | The Theater Center - December 7, 2022
  • Big Data LDN 2023 | Olympia London - September 20, 2023
More events

Related Online Courses

  • Oracle Cloud Data Management Foundations Workshop
  • Data Science at Scale
  • Statistics with Python
More courses

Footer


Datafloq is the one-stop source for big data, blockchain and artificial intelligence. We offer information, insights and opportunities to drive innovation with emerging technologies.

  • Facebook
  • LinkedIn
  • RSS
  • Twitter

Recent

  • 5 Reasons Why Modern Data Integration Gives You a Competitive Advantage
  • 5 Most Common Database Structures for Small Businesses
  • 6 Ways to Reduce IT Costs Through Observability
  • How is Big Data Analytics Used in Business? These 5 Use Cases Share Valuable Insights
  • How Realistic Are Self-Driving Cars?

Search

Tags

AI Amazon analysis analytics app application Artificial Intelligence BI Big Data business China Cloud Companies company costs crypto customers Data design development digital engineer engineering environment experience future Google+ government health information machine learning market mobile news public research security services share skills social social media software strategy technology

Copyright © 2023 Datafloq
HTML Sitemap| Privacy| Terms| Cookies

  • Facebook
  • Twitter
  • LinkedIn
  • WhatsApp

In order to optimize the website and to continuously improve Datafloq, we use cookies. For more information click here.

settings

Dear visitor,
Thank you for visiting Datafloq. If you find our content interesting, please subscribe to our weekly newsletter:

Did you know that you can publish job posts for free on Datafloq? You can start immediately and find the best candidates for free! Click here to get started.

Not Now Subscribe

Thanks for visiting Datafloq
If you enjoyed our content on emerging technologies, why not subscribe to our weekly newsletter to receive the latest news straight into your mailbox?

Subscribe

No thanks

Privacy Overview

This website uses cookies so that we can provide you with the best user experience possible. Cookie information is stored in your browser and performs functions such as recognising you when you return to our website and helping our team to understand which sections of the website you find most interesting and useful.

Necessary Cookies

Strictly Necessary Cookie should be enabled at all times so that we can save your preferences for cookie settings.

If you disable this cookie, we will not be able to save your preferences. This means that every time you visit this website you will need to enable or disable cookies again.

Marketing cookies

This website uses Google Analytics to collect anonymous information such as the number of visitors to the site, and the most popular pages.

Keeping this cookie enabled helps us to improve our website.

Please enable Strictly Necessary Cookies first so that we can save your preferences!