This article aims to explain a few common concepts and terms in the big data world for an audience that is not as technical as an engineering one but at the same time is at least familiar a bit with the big data space. The words are in no particular order, and a bit of technical language is used. Reach out if something is not clear! Relational database management … [Read more...] about The Most Important Big Data Concepts and What They Mean
hbase
Real-Time Kafka Data Ingestion into HBase via PySpark
Streaming data is becoming an essential part of every data integration project nowadays, if not a focus requirement, a second nature. Advantages gained from real-time data streaming are so many. To name a few: real-time analytics and decision making, better resource utilization, data pipelining, facilitation for micro-services and much more. Python has many modules out there … [Read more...] about Real-Time Kafka Data Ingestion into HBase via PySpark
Big Data Series
A smart approach to learn Big Data Analytics & Digital Transformation Big Data Analytics & Digital Transformation Workshops are designed for organisations to help them learn the necessary tools to create a roadmap for a comprehensive, future-proof digital transformation strategy. Led by world-class gurus, these workshops are hosting key decision makers and senior … [Read more...] about Big Data Series
How to Integrate Sqoop in a Data Ingestion Layer with Hive, Oozie in a Big Data Application?
Sqoop is a tool which helps to migrate and transfer the data between RDBMS and Hadoop system in bulk mode. This blog post will focus on integrating Sqoop with other projects in Hadoop ecosystem and Big Data Applications. As I am working for Big Data Solution providers, I learned it and here I will show how to schedule Sqoop job with Oozie and how to load the data from Sqoop to … [Read more...] about How to Integrate Sqoop in a Data Ingestion Layer with Hive, Oozie in a Big Data Application?
Introduction To Bulk Deletion Of Column Values In Hadoop Development With MapReduce
This article is about learning how to delete bulk column values by using Hbase bulk loading with Hadoop MapReduce. Proficient hadoop developers are sharing important things required for bulk column deletion in hadoop development. You can follow the steps shared by them to know how they do it. We are introducing a way to delete multiple column values by using hbasebulkloading … [Read more...] about Introduction To Bulk Deletion Of Column Values In Hadoop Development With MapReduce