• Skip to main content
  • Skip to secondary menu
  • Skip to primary sidebar
  • Skip to footer
  • Articles
  • News
  • Events
  • Advertize
  • Jobs
  • Courses
  • Contact
  • (0)
  • LoginRegister
    • Facebook
    • LinkedIn
    • RSS
      Articles
      News
      Events
      Job Posts
    • Twitter
Datafloq

Datafloq

Data and Technology Insights

  • Categories
    • Big Data
    • Blockchain
    • Cloud
    • Internet Of things
    • Metaverse
    • Robotics
    • Security
    • Startups
    • Strategy
    • Technical
  • Big Data
  • Blockchain
  • Cloud
  • Metaverse
  • Internet Of things
  • Robotics
  • Security
  • Startups
  • Strategy
  • Technical

Data mining

Data mining (the analysis step of the “Knowledge Discovery in Databases” process, or KDD), an interdisciplinary subfield of computer science, is the computational process of discovering patterns in large data sets involving methods at the intersection of artificial intelligence, machine learning, statistics, and database systems. The overall goal of the data mining process is to extract information from a data set and transform it into an understandable structure for further use. Aside from the raw analysis step, it involves database and data management aspects, data pre-processing, model and inference considerations, interestingness metrics, complexity considerations, post-processing of discovered structures, visualization, and online updating. The term is a misnomer, because the goal is the extraction of patterns and knowledge from large amount of data, not the extraction of data itself. It also is a buzzword and is frequently applied to any form of large-scale data or information processing (collection, extraction, warehousing, analysis, and statistics) as well as any application of computer decision support system, including artificial intelligence, machine learning, and business intelligence. The popular book “Data mining: Practical machine learning tools and techniques with Java” (which covers mostly machine learning material) was originally to be named just “Practical machine learning”, and the term “data mining” was only added for marketing reasons. Often the more general terms “(large scale) data analysis”, or “analytics”
or when referring to actual methods, artificial intelligence and machine learning
are more appropriate. The actual data mining task is the automatic or semi-automatic analysis of large quantities of data to extract previously unknown interesting patterns such as groups of data records (cluster analysis), unusual records (anomaly detection) and dependencies (association rule mining). This usually involves using database techniques such as spatial indices. These patterns can then be seen as a kind of summary of the input data, and may be used in further analysis or, for example, in machine learning and predictive analytics. For example, the data mining step might identify multiple groups in the data, which can then be used to obtain more accurate prediction results by a decision support system. Neither the data collection, data preparation, nor result interpretation and reporting are part of the data mining step, but do belong to the overall KDD process as additional steps. The related terms data dredging, data fishing, and data snooping refer to the use of data mining methods to sample parts of a larger population data set that are (or may be) too small for reliable statistical inferences to be made about the validity of any patterns discovered. These methods can, however, be used in creating new hypotheses to test against the larger data populations.

Tweet
Share
Share
WhatsApp

Primary Sidebar

E-mail Newsletter

Sign up to receive email updates daily and to hear what's going on with us!

Publish
AN Article
Submit
a press release
List
AN Event
Create
A Job Post

Jobs

  • Clinical Quality Data Analyst – Telecommute in Eastern Time zone | Philadelphia County, PA, USA - August 07, 2022
  • Clinical Quality Data Analyst – Telecommute in Eastern Time zone | Phoenix, AZ, USA - August 07, 2022
  • Java Developer- Telecommute | Atlanta, GA, USA - August 07, 2022
  • Java Developer | Aston, GB - August 07, 2022
  • Future Opportunities – Salesforce Devops Architect (Remote/Travel) | UK, GB - August 07, 2022
More Jobs

Tags

AI Amazon analytics application Artificial Intelligence Azure benefits BI Big Data business Cloud company Covid-19 Data design development DevOps engineer engineering environment experience future government Group health information Java knowledge machine learning mobile news platform public requirements research security services share skills social software software engineer solutions technical technology

News

  • Musk challenges Twitter CEO to public debate on bots
  • Musk says Twitter deal should go ahead if it provides proof of real accounts
  • Trump social media deal can’t close on time, needs extension, buyer says
  • California appeals court rules no arbitration in Cisco caste bias case
  • Former Centerview banker sues firm for records over pay clash
More News

Footer


Datafloq is the one-stop source for big data, blockchain and artificial intelligence. We offer information, insights and opportunities to drive innovation with emerging technologies.

  • Facebook
  • LinkedIn
  • RSS
  • Twitter

Recent

  • Sentiment Analysis: Categories, Methods, and Use Cases
  • Moving to a Tokenized Economy: Challenges and Opportunities
  • What is a Helm Chart In Kubernetes?
  • The Problem of Bias in Artificial Intelligence
  • 6 Crucial Steps For Setting Your Data Team KPIs

Search

Tags

AI Amazon analytics application Artificial Intelligence Azure benefits BI Big Data business Cloud company Covid-19 Data design development DevOps engineer engineering environment experience future government Group health information Java knowledge machine learning mobile news platform public requirements research security services share skills social software software engineer solutions technical technology

Copyright © 2022 Datafloq
Privacy|Terms|Cookies

In order to optimize the website and to continuously improve Datafloq, we use cookies. For more information click here.

settings

Dear visitor,
Thank you for visiting Datafloq. If you find our content interesting, please subscribe to our weekly newsletter:

Did you know that you can publish job posts for free on Datafloq? You can start immediately and find the best candidates for free! Click here to get started.

Not Now Subscribe

Thanks for visiting Datafloq
If you enjoyed our content on emerging technologies, why not subscribe to our weekly newsletter to receive the latest news straight into your mailbox?

Subscribe

No thanks

Privacy Overview

This website uses cookies so that we can provide you with the best user experience possible. Cookie information is stored in your browser and performs functions such as recognising you when you return to our website and helping our team to understand which sections of the website you find most interesting and useful.

Necessary Cookies

Strictly Necessary Cookie should be enabled at all times so that we can save your preferences for cookie settings.

If you disable this cookie, we will not be able to save your preferences. This means that every time you visit this website you will need to enable or disable cookies again.

Marketing cookies

This website uses Google Analytics to collect anonymous information such as the number of visitors to the site, and the most popular pages.

Keeping this cookie enabled helps us to improve our website.

Please enable Strictly Necessary Cookies first so that we can save your preferences!