AWS Cloud Computing

Introduction to Simple Notification Service (Amazon SNS)

By Naveen P.N January 27, 2022

Continue Reading

Uncategorized

Kafka Commands

By Naveen P.N January 25, 2022

Continue Reading

Uncategorized

Getting Started with AWS

By Naveen P.N January 13, 2022

Continue Reading

Introduction to Simple Notification Service (Amazon SNS)

By Naveen P.N January 27, 2022AWS, Cloud Computing

Introduction to Simple Notification Service (Amazon SNS) In modern cloud architecture, applications are decoupled into smaller, independent building blocks that are easier to develop, deploy and maintain. A Publish/Subscribe (Pub/Sub)…

Kafka Commands

By Naveen P.N January 25, 2022Uncategorized

Kafka Commands Starting Zookeeper & Kafka [npntraining ~]$ cd \opt\kafka\zookeeper-3.5.6-bin\bin [npntraining ~]$ zkserver.sh Starting Kafka [npntraining ~]$ cd \opt\kafka\kafka_2.12-2.4.0\bin\ [npntraining ~]$ kafka-server-start.sh \opt\kafka\kafka_2.12-2.4.0\config\server.properties Kafka Topic You can create a new…

Getting Started with AWS

By Naveen P.N January 13, 2022Uncategorized

Getting Started with AWS Cloud computing is the delivery of computing services - including servers, storage, databases, networking, software, analytics -o ver the Internet (“the cloud”) Amazon Web Services is…

Memory Consumption of Hadoop NameNode

By Naveen P.N January 12, 2022Uncategorized

Each file or directory or block occupies about 150 bytes in the namenode memory. So a cluster with a namenode with 32G RAM can support a maximum of (assuming namenode…

Anatomy of File Read and Write

By Naveen P.N January 12, 2022Uncategorized

HDFS has a master and slave kind of architecture. Namenode acts as master and Datanodes as worker. All the metadata information is with namenode and the original data is stored…

Protected: Connecting to Hive Programatically

By Naveen P.N January 10, 2022Data Engineering, Hadoop, Hive

This content is password protected. To view it please go to the post page and enter the password.

Configure SQL Workbench and Query Hive

By Naveen P.N December 30, 2021AWS, Cloud Computing

In this blog post, I will show you how to connect to Hive using SQL Workbench. Download the SQL Workbench and Drivers Configure Drivers Launch SQLWorkbench64.exe Click on Manage Driver…

How to convert RDD to DataFrame

By Naveen P.N November 8, 2021Data Engineering, Data Science

In this blog post we will learn how to convert RDD to DataFrame with spark helper methods used in local development or testing.Converting RDD to Data FrameFirst let us create…

Hadoop File System Commands

By Naveen P.N November 8, 2021Hadoop, Hive, Java Programming, Selenium, Statistics

In this blog post I will explain different HDFS commands to access HDFS which are commonly used while working as a Big Data Developer Training. Hadoop provides command line interface…

What is Serverless Architecture

By Naveen P.N November 8, 2021Apache Spark, BDD-Cucumber, Cloud Computing, Kafka

In this blog post I will explain what is Serverless Architecture. Serverless architecture is a way to build and run applications and services without having to manage the infrastructure behind…

How To Upload a File with Selenium

By Naveen P.N November 8, 2021Kafka, Machine Learning

It is very common functionality to upload a file to the webserver. But when it involves in automating you get a dialog box that is just out of reach for…

Various Entry Points for Apache Spark

By Naveen P.N November 8, 2021Apache Spark, Data Engineering

In Data Engineering Apache Spark is probably one of the most popular framework to process huge volume of data. In this blog post I am going to cover the various entry points…