• Skip to main content
  • Skip to secondary menu
  • Skip to primary sidebar
  • Skip to footer
  • Articles
  • News
  • Events
  • Advertize
  • Jobs
  • Courses
  • Contact
  • (0)
  • LoginRegister
    • Facebook
    • LinkedIn
    • RSS
      Articles
      News
      Events
      Job Posts
    • Twitter
Datafloq

Datafloq

Data and Technology Insights

  • Categories
    • Big Data
    • Blockchain
    • Cloud
    • Internet Of Things
    • Metaverse
    • Robotics
    • Cybersecurity
    • Startups
    • Strategy
    • Technical
  • Big Data
  • Blockchain
  • Cloud
  • Metaverse
  • Internet Of Things
  • Robotics
  • Cybersecurity
  • Startups
  • Strategy
  • Technical

What if Machine Learning as a Service Does Not Work?

Maxim Tereschenko / 6 min read.
December 4, 2017
Datafloq AI Score
×

Datafloq AI Score: 66.33

Datafloq enables anyone to contribute articles, but we value high-quality content. This means that we do not accept SEO link building content, spammy articles, clickbait, articles written by bots and especially not misinformation. Therefore, we have developed an AI, built using multiple built open-source and proprietary tools to instantly define whether an article is written by a human or a bot and determine the level of bias, objectivity, whether it is fact-based or not, sentiment and overall quality.

Articles published on Datafloq need to have a minimum AI score of 60% and we provide this graph to give more detailed information on how we rate this article. Please note that this is a work in progress and if you have any suggestions, feel free to contact us.

floq.to/78tRQ

In our previous article on Data Science Industry Perspectives in the Cloud, we’ve discussed that evolution is key if you plan to grow your business. You can start with the ready-made solutions, then, in time, you can switch to the in-house ones that can be done with the help of a group of data scientists.

This time around we’ll talk about the cases and solutions when ML as a Service doesn’t work. When this happens, your company shouldn’t start off with hiring al Data Scientist. The best option is to invest in a custom-made solution to solve your urgent business needs. Only then when you have a workable solution, can you dive into deep Data Science and create a proper team that can create an in-house Data Science solution.

Key Takeaways

  • The time when you don’t need to hire data scientists and when you should start investing in data science resources.
  • Start building a comprehensive Data Science product not from the research, but from the end-to-end solution that solves business problems.
  • People envisage a Data Scientist that has a balance of knowledge in relative subject matters, but in real life, you can barely find such an ideal candidate.
  • Make your Data Scientists successful and productive. Any team that deals with Data Science has to be cross-functional with adjacent roles contributing to the end solution.
  • Deliver end-to-end solutions that solve business problems rather than research papers.

Custom made end-to-end solution as a start

Here we should talk about the classical Data Science, where you have data, goals and you need to build models to solve a pressing issue. The best way to do this is to jumpstart such a process by putting together bits and pieces of some ready-made services into a single workable product and show your customer a clear-cut result shortly. This can be done without any complex or global research, and you can comfortably formulate specifications taking into account all the feedback from your customer and create a more high-level Data Science product in the long run.

machine learning

One of the biggest issues for any Data Science project is in formulating the specifications for it. The usual request is Create something for my company using Data Science. Analyze data for me. This type of a job has lots of trials and errors. And having some custom basic end-to-end solution from the start lets you insert into it the needed extra services on the go. This lets us provide insights and predictions into a specific business workflow much easier.

I believe that you should start to build your system from the ground up not from the Data Science research, but from the point of view of an end-to-end solution. And then bit by bit you can take out and insert the required services in the process.

machine learning

How to do Data Science research?

You start with employing a Data Scientist that matches your company’s needs. You can use the standard Data Scientist chart: this employee should have a firm footing in the application environment, mathematics, and programming. Their business skills are key to success. I believe that an understanding of the topical area is key here because we are solving business issues. And the Data Scientist that solves more academic problems will be more focused on winning a Kaggle competition than addressing the business needs. It is important that this person understands the product development cycle. This way he builds up models and analyses data in such a way that it would be possible to be deployed in production.

For example, if a person is using R (language and framework for Data Scientists) we should note that it is more aimed at research and it is not production ready. Correspondingly the results of such research cannot be deployed in any end-to-end solution. Therefore, he should take note of this and work in pair with a programmer. Although in the above diagram we see that the Data Scientist should have hacking skills, in reality, this is not the case. In their vast majority, Data Scientists are not able to write quality code. And if you need not just the research but a solution then process-wise you need to have a Data Scientist working together with a Data Engineer.


Interested in what the future will bring? Download our 2023 Technology Trends eBook for free.

Consent

machine learning

CAP theory as analogy

It is impossible to have three database properties at once: consistency, availability, and partition-tolerance. This is the basic rule know to all developers. And this is true for a Data Scientist as well, as people envisage a Data Scientist on the overlap of these subjects, but in real life, you cannot find such an ideal candidate. Usually, people tend to lean one or another way in their work and keeping a balance is not always a priority.

In principle one of the solutions that we use ourselves is that a Data Scientist should have business insights, understand the math behind it and work together in tandem with a skilled developer. Of course, a proper Data Scientist should be able to write any semblance of code. But a Data Scientist should work with a Data Engineer, write and realise code being both responsible for the quality and sustainability of this solution.

The classic tragedy of a company that decides to initiate any Data Science research is in when a Data Scientist says I’ve got 40000 lines of Python code on my PC, can you make it work in production? And, of course, this is virtually impossible to do. So you have an issue at hand that all of the research is simply wasted.

machine learning

Cross-functional teams

Any team that deals with Data Science has to be cross-functional, i.e. it has to cover a whole stack of the solutions it writes. In a normal infrastructure there should be present a DevOps Engineer, Data Scientist, Data Engineer, and a Product Developer writing the web app and/or mobile app. And this is a single team that is responsible for the result. They should work together and solve related tasks that are interconnected in their interactions.

All of this means that the whole team is responsible for the business result. This is also true for the transitionary research done by a Data Scientist which is impossible to use in production on its own.

machine learning

Old-school vs. Vertical teams

To dig in deeper, let’s take a classic old-school layered company organisation structure when you have a department of Data Scientists, Operations, UI Developers, Big Data department, QA Engineers and so on. In this case, we have every project penetrating most of these teams. And the classic problem is that tickets and tasks are being thrown around by one team to another, and the real business goals are being watered down along the way and not solved in the end. So instead of this horizontal division, we have divided the teams vertically. This allowed us to create teams that see a clear-cut goal they need to achieve. And at the same time, they can improve their cross-skills, and boost their responsibility levels.

As a result, such teams began to deliver, Scrum and Agile began to work properly. It is not directly related to Data Science, but nevertheless, there is a standard mistake of many companies where Data Scientists work somewhere at a university and write mostly academic papers. It is a topic for a whole new article, but for now, you need to distinguish that there is a Data Scientist and a Production Data Scientist. And you should aim at employing the latter one within your teams, and not let a Data Scientist work alone remotely.

Categories: Artificial Intelligence
Tags: Big Data, machine learning, research

About Maxim Tereschenko

Maxim Tereschenko is the Head of BI / Big Data Practice at Squadex.com. Led global consulting and outsourcing data projects. Managed Connectivity area for Zoomdata, one of the fastest growing Big Data analytic platforms. Built comprehensive analytical solutions for the FinTech sector from scratch. A true believer that proper data analysis is a key to significant improvement of business operations, and maybe even the World. Connect with him on LinkedIn.

Primary Sidebar

E-mail Newsletter

Sign up to receive email updates daily and to hear what's going on with us!

Publish
AN Article
Submit
a press release
List
AN Event
Create
A Job Post

Related Articles

The Advantages of IT Staff Augmentation Over Traditional Hiring

May 4, 2023 By Mukesh Ram

The State of Digital Asset Management in 2023

May 3, 2023 By pimcoremkt

Test Data Management – Implementation Challenges and Tools Available

May 1, 2023 By yash.mehta262

Related Jobs

  • Software Engineer | South Yorkshire, GB - February 07, 2023
  • Software Engineer with C# .net Investment House | London, GB - February 07, 2023
  • Senior Java Developer | London, GB - February 07, 2023
  • Software Engineer – Growing Digital Media Company | London, GB - February 07, 2023
  • LBG Returners – Senior Data Analyst | Chester Moor, GB - February 07, 2023
More Jobs

Tags

AI Amazon analysis analytics application Artificial Intelligence BI Big Data business China Cloud Companies company crypto customers Data design development digital engineer engineering environment experience future Google+ government Group health information learning machine learning mobile news public research security services share skills social social media software solutions strategy technology

Related Events

  • 6th Middle East Banking AI & Analytics Summit 2023 | Riyadh, Saudi Arabia - May 10, 2023
  • Data Science Salon NYC: AI & Machine Learning in Finance & Technology | The Theater Center - December 7, 2022
  • Big Data LDN 2023 | Olympia London - September 20, 2023
More events

Related Online Courses

  • Oracle Cloud Data Management Foundations Workshop
  • Data Science at Scale
  • Statistics with Python
More courses

Footer


Datafloq is the one-stop source for big data, blockchain and artificial intelligence. We offer information, insights and opportunities to drive innovation with emerging technologies.

  • Facebook
  • LinkedIn
  • RSS
  • Twitter

Recent

  • 5 Reasons Why Modern Data Integration Gives You a Competitive Advantage
  • 5 Most Common Database Structures for Small Businesses
  • 6 Ways to Reduce IT Costs Through Observability
  • How is Big Data Analytics Used in Business? These 5 Use Cases Share Valuable Insights
  • How Realistic Are Self-Driving Cars?

Search

Tags

AI Amazon analysis analytics application Artificial Intelligence BI Big Data business China Cloud Companies company crypto customers Data design development digital engineer engineering environment experience future Google+ government Group health information learning machine learning mobile news public research security services share skills social social media software solutions strategy technology

Copyright © 2023 Datafloq
HTML Sitemap| Privacy| Terms| Cookies

  • Facebook
  • Twitter
  • LinkedIn
  • WhatsApp

In order to optimize the website and to continuously improve Datafloq, we use cookies. For more information click here.

settings

Dear visitor,
Thank you for visiting Datafloq. If you find our content interesting, please subscribe to our weekly newsletter:

Did you know that you can publish job posts for free on Datafloq? You can start immediately and find the best candidates for free! Click here to get started.

Not Now Subscribe

Thanks for visiting Datafloq
If you enjoyed our content on emerging technologies, why not subscribe to our weekly newsletter to receive the latest news straight into your mailbox?

Subscribe

No thanks

Privacy Overview

This website uses cookies so that we can provide you with the best user experience possible. Cookie information is stored in your browser and performs functions such as recognising you when you return to our website and helping our team to understand which sections of the website you find most interesting and useful.

Necessary Cookies

Strictly Necessary Cookie should be enabled at all times so that we can save your preferences for cookie settings.

If you disable this cookie, we will not be able to save your preferences. This means that every time you visit this website you will need to enable or disable cookies again.

Marketing cookies

This website uses Google Analytics to collect anonymous information such as the number of visitors to the site, and the most popular pages.

Keeping this cookie enabled helps us to improve our website.

Please enable Strictly Necessary Cookies first so that we can save your preferences!