• Skip to main content
  • Skip to secondary menu
  • Skip to primary sidebar
  • Skip to footer
  • Articles
  • News
  • Events
  • Advertize
  • Jobs
  • Courses
  • Contact
  • (0)
  • LoginRegister
    • Facebook
    • LinkedIn
    • RSS
      Articles
      News
      Events
      Job Posts
    • Twitter
Datafloq

Datafloq

Data and Technology Insights

  • Categories
    • Big Data
    • Blockchain
    • Cloud
    • Internet Of Things
    • Metaverse
    • Robotics
    • Cybersecurity
    • Startups
    • Strategy
    • Technical
  • Big Data
  • Blockchain
  • Cloud
  • Metaverse
  • Internet Of Things
  • Robotics
  • Cybersecurity
  • Startups
  • Strategy
  • Technical

Predictive Analytics for Beginners – How to Get Started

Aleksandra Besińska / 3 min read.
October 21, 2015
Datafloq AI Score
×

Datafloq AI Score: 73.67

Datafloq enables anyone to contribute articles, but we value high-quality content. This means that we do not accept SEO link building content, spammy articles, clickbait, articles written by bots and especially not misinformation. Therefore, we have developed an AI, built using multiple built open-source and proprietary tools to instantly define whether an article is written by a human or a bot and determine the level of bias, objectivity, whether it is fact-based or not, sentiment and overall quality.

Articles published on Datafloq need to have a minimum AI score of 60% and we provide this graph to give more detailed information on how we rate this article. Please note that this is a work in progress and if you have any suggestions, feel free to contact us.

floq.to/Rip0P

Data is everywhere. We generate data when using an ATM, browsing the Internet, calling our friends, buying shoes in our favourite e-shop or posting on Facebook. Companies collect this data en masse in order to make more informed business decisions, such as:

  • Which customers should participate in our promotional campaign for a given product in order to maximize response?
  • Which customers should be paid special attention to, as they might be considering resigning from using our services?
  • Is a particular customer trustworthy and does he/she qualify for a mortgage loan?

It is not always easy to obtain answers to questions like the ones above. In such situations it is worthwhile to resort to predictive analytics, which provides valuable information that can help make the right decisions. In simple terms, predictive analytics let us predict the future on the basis of historical data.

For example, if we know which customers stopped using our products in the past, we can build an analytical model describing the patterns of their behavior and characteristics. If we observe similar behavior in other customers, especially the ones which are the most valuable to us (as they generate the largest sales), we may try to prevent them from departing.Predictive analytics will provide us with the ranking of our customers according to their risk of departure (this is the so called score in our example, the higher the score, the higher the departure risk).

However, to build such an analytical model we need historical data

Source Data

Most often the data required for modeling are obtained from databases or flat files. Below, we discuss an example table with source data and ways of interpreting it.

Predictive-Analytics-for-Beginners-table

Table columns are referred to as variables or attributes, while table rows are called records, observations or objects.

Variables can be:

  • numerical (also called quantitative or continuous)  for example age, income, temperature,
  • categorical (also called discrete, qualitative, nominal)  such as gender, occupation, eye color.

We distinguish two basic roles a variable can have:


Interested in what the future will bring? Download our 2023 Technology Trends eBook for free.

Consent

  • independent (also called predictor, explanatory, feature)  these variables describe the properties of objects which we want to use as the basis for making inferences,
  • dependent (also called response, explained, target)  these variables describe the features of the object which we want to make inferences about.

It is worth remembering, that information about the target variable should not be used when calculating the values of explanatory variables.

Depending on the industry, task, there can be a lot of variables available for analysis. We have worked on databases that had tens of thousands of variables. In addition, variable names are not always understandable. For example, who would guess that POP901, MARR1, IC10 means respectively number of persons, the percentage of married, the percentage of households with an income $ 50,000 $ 74.999 ? Therefore, during data analysis one should have a data dictionary with variables description.

Analytical Model

A predictive model describes the dependencies between explanatory variables and the target. It lets us to predict the target value on the basis of explanatory variables. There are many types of models. The most popular ones include:

  • regression (with the dependency expressed using a mathematical formula). An example:

Predictive-Analytics-for-Beginners-regression

  • decision tree (where the dependency is encoded using a tree-resembling graph). An example:

Predictive-Analytics-for-Beginners-decision-tree

Models can have the following roles:

  • classification  the target variable is discrete (i.e. decision trees, logistic regression),
  • approximation  the target is continuous (i.e. linear regression, neural networks),
  • association  co-occurrence of values (i.e. A-Priori algorithms, associative networks),
  • segmentation  division into subgroups (i.e. k-means algorithm, Kohonen networks).

In our next post we are going to focus on the model building process. We will show it on an example of classification task. Classification is the process of assigning every object from a collection to exactly one class from a known set of classes. Examples of classification tasks are: assigning a patient (the object) to a group of healthy or ill (the classes) people on the basis of his or her medical record or determining the customers (the object) credibility during credit application using, for example, demographic and financial data; in this case the classes are credible and not credible.

The third part will be devoted to the notions of scoring and cut-off point.

Categories: Big Data
Tags: analytics, Big Data, business, predictive analytics, regression, segmenting

About Aleksandra Besińska

Aleksandra Besi ska is a Marketing Director at Algolytics - a Polish company offering solutions for Predictive Analytics, Data Quality, Social Network Analysis and other advanced data analysis tasks. She is responsible for Algolytics marketing strategy and activities as well as sales support. Graduate of two among most prestigious universities in Poland - Warsaw School of Economics and Warsaw University, always wanted to work in the area that would combine both marketing and new technologies.

10 years of experience gained while working for Algolytics enabled her to discovered not only how enormous potential is hidden in data but also the diversity of applications predictive analytics has. She is happy to have the opportunity to offer solutions that help companies grow by using valuable knowledge extracted from the gathered data. Mother of a beautiful 2 year old daughter, she can't imagine a day without a cup of good coffee.

Primary Sidebar

E-mail Newsletter

Sign up to receive email updates daily and to hear what's going on with us!

Publish
AN Article
Submit
a press release
List
AN Event
Create
A Job Post

Related Articles

What is Enterprise Application Integration (EAI), and How Should Your Company Approach It?

March 29, 2023 By Terry Wilson

5 Best Data Engineering Projects & Ideas for Beginners

March 29, 2023 By emily.joe685

Data Centre World Asia

March 29, 2023 By r.chan

Related Jobs

  • Software Engineer | South Yorkshire, GB - February 07, 2023
  • Software Engineer with C# .net Investment House | London, GB - February 07, 2023
  • Senior Java Developer | London, GB - February 07, 2023
  • Software Engineer – Growing Digital Media Company | London, GB - February 07, 2023
  • LBG Returners – Senior Data Analyst | Chester Moor, GB - February 07, 2023
More Jobs

Tags

AI Amazon analysis analytics app application Artificial Intelligence BI Big Data blockchain business China Cloud Companies company costs crypto customers Data development digital environment experience finance future Google+ government information learning machine learning market mobile Musk news public research security share social social media software startup strategy technology twitter

Related Events

  • 6th Middle East Banking AI & Analytics Summit 2023 | Riyadh, Saudi Arabia - May 10, 2023
  • Data Science Salon NYC: AI & Machine Learning in Finance & Technology | The Theater Center - December 7, 2022
  • Big Data LDN 2023 | Olympia London - September 20, 2023
More events

Related Online Courses

  • Big Data & AI World, Singapore
  • Finding Your Professional Voice: Confidence & Impact
  • Creating a Differential Competitive Advantage -Jagdish Sheth
More courses

Footer


Datafloq is the one-stop source for big data, blockchain and artificial intelligence. We offer information, insights and opportunities to drive innovation with emerging technologies.

  • Facebook
  • LinkedIn
  • RSS
  • Twitter

Recent

  • How to Validate OpenAI GPT Model Performance with Text Summarization (Part 1)
  • What is Enterprise Application Integration (EAI), and How Should Your Company Approach It?
  • 5 Best Data Engineering Projects & Ideas for Beginners
  • Personalization Vs. Hyper-Personalization: Benefits, Limitations and Potential
  • Explaining data products lifecycle and their scope in management

Search

Tags

AI Amazon analysis analytics app application Artificial Intelligence BI Big Data blockchain business China Cloud Companies company costs crypto customers Data development digital environment experience finance future Google+ government information learning machine learning market mobile Musk news public research security share social social media software startup strategy technology twitter

Copyright © 2023 Datafloq
HTML Sitemap| Privacy| Terms| Cookies

  • Facebook
  • Twitter
  • LinkedIn
  • WhatsApp

In order to optimize the website and to continuously improve Datafloq, we use cookies. For more information click here.

settings

Dear visitor,
Thank you for visiting Datafloq. If you find our content interesting, please subscribe to our weekly newsletter:

Did you know that you can publish job posts for free on Datafloq? You can start immediately and find the best candidates for free! Click here to get started.

Not Now Subscribe

Thanks for visiting Datafloq
If you enjoyed our content on emerging technologies, why not subscribe to our weekly newsletter to receive the latest news straight into your mailbox?

Subscribe

No thanks

Privacy Overview

This website uses cookies so that we can provide you with the best user experience possible. Cookie information is stored in your browser and performs functions such as recognising you when you return to our website and helping our team to understand which sections of the website you find most interesting and useful.

Necessary Cookies

Strictly Necessary Cookie should be enabled at all times so that we can save your preferences for cookie settings.

If you disable this cookie, we will not be able to save your preferences. This means that every time you visit this website you will need to enable or disable cookies again.

Marketing cookies

This website uses Google Analytics to collect anonymous information such as the number of visitors to the site, and the most popular pages.

Keeping this cookie enabled helps us to improve our website.

Please enable Strictly Necessary Cookies first so that we can save your preferences!