• Skip to main content
  • Skip to secondary menu
  • Skip to primary sidebar
  • Skip to footer
  • Articles
  • News
  • Events
  • Advertize
  • Jobs
  • Courses
  • Contact
  • (0)
  • LoginRegister
    • Facebook
    • LinkedIn
    • RSS
      Articles
      News
      Events
      Job Posts
    • Twitter
Datafloq

Datafloq

Data and Technology Insights

  • Categories
    • Big Data
    • Blockchain
    • Cloud
    • Internet Of Things
    • Metaverse
    • Robotics
    • Cybersecurity
    • Startups
    • Strategy
    • Technical
  • Big Data
  • Blockchain
  • Cloud
  • Metaverse
  • Internet Of Things
  • Robotics
  • Cybersecurity
  • Startups
  • Strategy
  • Technical

Data is Immortal, but Not Immune to Decay

Martin Doyle / 4 min read.
June 3, 2015
Datafloq AI Score
×

Datafloq AI Score: 83.33

Datafloq enables anyone to contribute articles, but we value high-quality content. This means that we do not accept SEO link building content, spammy articles, clickbait, articles written by bots and especially not misinformation. Therefore, we have developed an AI, built using multiple built open-source and proprietary tools to instantly define whether an article is written by a human or a bot and determine the level of bias, objectivity, whether it is fact-based or not, sentiment and overall quality.

Articles published on Datafloq need to have a minimum AI score of 60% and we provide this graph to give more detailed information on how we rate this article. Please note that this is a work in progress and if you have any suggestions, feel free to contact us.

floq.to/vhqpV

Data exists in a dangerous state of near-non existence. Few businesses would risk not having backups in place. With cloud computing becoming commonplace in enterprise, weve come to accept that our data will be replicated and stored in duplicate.

Even data that is intentionally deleted can often be recovered. When Yahoo! purchased Geocities, nobody dreamed that it would go ahead and delete the entire archive more than 600 gigabytes of internet history. Despite this, enthusiasts were able to quickly archive the collective work of 35 avid Geocities webmasters an important milestone in our ability to breathe new life into data that someone else does not want.

The Cost of Deletion

Deleting data is not just a catastrophe for the user, or the business, or the system itself. Deletion of data also has a cost attached. Weve all deleted files, essays, reports or emails by accident, and weve been forced to spend hours recreating what we lost.  Other consequences also make data loss costly: loss of custom, loss of reputation, or damage to a brand.

Often, it is easier to harvest massive amounts of data, and create massive backups, than to be selective and economical. This means we have huge data silos just waiting to be used. Thats if theyre useful at all.

The Problem with Immortality

If data is immortal, surely we can just save it, back it up and move on? Why worry about data if it can take such good care of itself? Surely all we need to do is back it up on a regular basis?

Unfortunately, its not that simple. Data may be able to replicate itself and survive certain catastrophes, and we might be capable of creating lots of copies for a relatively cheap price.

But data cannot check its own validity or keep itself error-free. And the longer you keep a piece of data, the less useful it becomes.

The rate of data decay is estimated at about 2 percent per month. That doesnt mean a lot on paper. In real terms, industry experts suggest that almost a quarter of contacts in a regular CRM will be out of date in a year.


Interested in what the future will bring? Download our 2023 Technology Trends eBook for free.

Consent

Our own figures suggest that 42 per cent of failed CRM projects came off the rails because of the state of the data. Its not that businesses dont use the systems, or fail to adopt them enthusiastically. They just expect the data to live on, untouched, without any further maintenance.

Survival of the Fittest

Just because data is safe, and immortal, and secure, that does not necessarily mean that it is worth preserving. The worlds data centres are packed with old data that could be inaccessible, corrupted, duplicated or out of date or all of the above.

Clean data does not occur by accident. We get clean data because we invest time in making it so. That means:

  • Deduplicating records in a database so were sure we arent archiving more than one copy of the same thing
  • Checking records against other databases that have already been cleansed (for example, checking addresses against an official Royal Mail postcode database)
  • Ensuring old data is purged so that it does not live forever and sully the database

In order to purify the database, some kind of management or intervention is required.

Good From Bad

There are three main ways to obtain clean data from a dirty database.

  1. You assign a group of staff to contact every person in the database and check their records. Unless the business is a startup, or a niche organisation, this is very unlikely to be a practical or affordable course of action
  2. You use data quality software to automatically check for matches and mistakes, using sophisticated algorithms that can check millions of lines in minutes
  3. The whole dataset can be cleansed to a third party; when data purification is outsourced to a data bureau, the business can get on with its day to day work

Whichever route you chose, you must do something about your data. Simply saving it, copying it and archiving it is not going to ensure its quality and longevity.

Keeping Data Alive

Data storage technology has come on leaps and bounds in the last five years. From stone tablets and vellum, weve transformed the way we record the things we think, say and do. We can store hundreds of books on a tiny memory card, smaller than a postage stamp. And archives have saved the early days of the web from obliteration, ensuring that our first attempts at web design MIDI files and all are given their rightful place in history.

Its possible to manually undelete data, or delete it and start again. If data goes bad, we can trash the whole lot and purchase a list from a third party. But for the best ROI, the best conversions and the best relationships with our customers, we should aim to keep our data clean, and ensure its relevant and useful for years to come.

Categories: Big Data
Tags: Big Data, data quality, data silos, ROI, storage, technology

About Martin Doyle

Armed with qualifications in mechanical engineering, business and finance, and experience of running engineering and CRM businesses, Martin founded a successful CRM (Customer Relationship Management) software house in 1992, supplying systems to large, medium and small sized companies. Developing a deep understanding of the value of data, he became concerned that many organisations were making decisions based on poor quality data. To fill this gap in the market, he sold the CRM company and started DQ Global in 2002 to provide data quality solutions, with a mission to detect, correct and prevent data defects which undermine business decisions. Since then, DQ Global has become a global market leader, delivering enterprise-wide data solutions utilising leading edge technology. Martin has gained a wealth of knowledge and experience and has established himself as a Data Quality Improvement Evangelist and an industry expert.

Primary Sidebar

E-mail Newsletter

Sign up to receive email updates daily and to hear what's going on with us!

Publish
AN Article
Submit
a press release
List
AN Event
Create
A Job Post

Related Articles

How BlaBlaCar Built a Practical Data Mesh to Support Self-Service Analytics at Scale

March 23, 2023 By Barr Moses

A Beginner’s Guide to Reverse ETL: Concept and Use Cases

March 22, 2023 By Tehreem Naeem

How Data Analytics is Revolutionizing Talent Acquisition Leadership

March 20, 2023 By Monika Sangwan

Related Jobs

  • Software Engineer | South Yorkshire, GB - February 07, 2023
  • Software Engineer with C# .net Investment House | London, GB - February 07, 2023
  • Senior Java Developer | London, GB - February 07, 2023
  • Software Engineer – Growing Digital Media Company | London, GB - February 07, 2023
  • LBG Returners – Senior Data Analyst | Chester Moor, GB - February 07, 2023
More Jobs

Tags

AI Amazon analysis analytics app application Artificial Intelligence BI Big Data blockchain business China Cloud Companies company costs crypto Data development digital environment experience finance financial future Google+ government information machine learning market mobile Musk news public research security share skills social social media software startup strategy technology twitter

Related Events

  • 6th Middle East Banking AI & Analytics Summit 2023 | Riyadh, Saudi Arabia - May 10, 2023
  • Data Science Salon NYC: AI & Machine Learning in Finance & Technology | The Theater Center - December 7, 2022
  • Big Data LDN 2023 | Olympia London - September 20, 2023
More events

Related Online Courses

  • American History Through Baseball
  • Digital Marketing World Forum Global 2023
  • Webinar – How to harness financial data to help drive improved analytics and insights with Envestnet & AWS
More courses

Footer


Datafloq is the one-stop source for big data, blockchain and artificial intelligence. We offer information, insights and opportunities to drive innovation with emerging technologies.

  • Facebook
  • LinkedIn
  • RSS
  • Twitter

Recent

  • Microsoft Power BI -The Future of Healthcare’s Most Important Breakthrough
  • The Big Crunch of 2025: Is Your Data Safe from Quantum Computing?
  • From Data to Reality: Leveraging the Metaverse for Business Growth
  • How BlaBlaCar Built a Practical Data Mesh to Support Self-Service Analytics at Scale
  • How Blockchain Technology Can Enhance Fintech dApp Development

Search

Tags

AI Amazon analysis analytics app application Artificial Intelligence BI Big Data blockchain business China Cloud Companies company costs crypto Data development digital environment experience finance financial future Google+ government information machine learning market mobile Musk news public research security share skills social social media software startup strategy technology twitter

Copyright © 2023 Datafloq
HTML Sitemap| Privacy| Terms| Cookies

  • Facebook
  • Twitter
  • LinkedIn
  • WhatsApp

In order to optimize the website and to continuously improve Datafloq, we use cookies. For more information click here.

settings

Dear visitor,
Thank you for visiting Datafloq. If you find our content interesting, please subscribe to our weekly newsletter:

Did you know that you can publish job posts for free on Datafloq? You can start immediately and find the best candidates for free! Click here to get started.

Not Now Subscribe

Thanks for visiting Datafloq
If you enjoyed our content on emerging technologies, why not subscribe to our weekly newsletter to receive the latest news straight into your mailbox?

Subscribe

No thanks

Privacy Overview

This website uses cookies so that we can provide you with the best user experience possible. Cookie information is stored in your browser and performs functions such as recognising you when you return to our website and helping our team to understand which sections of the website you find most interesting and useful.

Necessary Cookies

Strictly Necessary Cookie should be enabled at all times so that we can save your preferences for cookie settings.

If you disable this cookie, we will not be able to save your preferences. This means that every time you visit this website you will need to enable or disable cookies again.

Marketing cookies

This website uses Google Analytics to collect anonymous information such as the number of visitors to the site, and the most popular pages.

Keeping this cookie enabled helps us to improve our website.

Please enable Strictly Necessary Cookies first so that we can save your preferences!