Skip to content

ConfusedCoders

the world is opensource
  • data engineering
    • ETL
      • hive
      • pig
    • distributed systems
      • spark
      • hadoop
      • hbase
      • apache drill
    • airflow
    • data storage
      • mongodb
    • search
      • solr
  • data science
    • deep learning
    • machine learning
    • visualization
  • general programming
    • whitepapers
    • open source
    • mobile
    • raspberrypi
    • data structures
    • golang
    • java
      • design patterns
      • hibernate
    • random
      • life
  • about us
    • Nikita Sharma | Data Science Student

ConfusedCoders

the world is opensource
  • data engineering
    • ETL
      • hive
      • pig
    • distributed systems
      • spark
      • hadoop
      • hbase
      • apache drill
    • airflow
    • data storage
      • mongodb
    • search
      • solr
  • data science
    • deep learning
    • machine learning
    • visualization
  • general programming
    • whitepapers
    • open source
    • mobile
    • raspberrypi
    • data structures
    • golang
    • java
      • design patterns
      • hibernate
    • random
      • life
  • about us
    • Nikita Sharma | Data Science Student

Real world application project for Big Data – with Apache Spark and AWS-EMR

  • Nikita Sharma Nikita Sharma
  • February 24, 2019February 24, 2019
  • airflow, AWS, data engineering, data storage, ETL, spark

Hey readers, I am learning Data Engineering from last few months and I thought of sharing my learning with you all. Recently I made a project… Read More »Real world application project for Big Data – with Apache Spark and AWS-EMR

Data Engineering Part 2 – Productionizing Big data ETL with Apache Airflow

  • Nikita Sharma Nikita Sharma
  • February 11, 2019February 11, 2019
  • airflow, data engineering, ETL

Hey readers, in previous post I have explained How to create a python ETL Project. In this post, I will explain how we can schedule/productionize our… Read More »Data Engineering Part 2 – Productionizing Big data ETL with Apache Airflow

I am starting my Masters in Data Science at UTS

  • Nikita Sharma Nikita Sharma
  • February 2, 2019February 2, 2019
  • data science, life

  I am so excited to start this journey. I am hoping to learn a lot of concepts and meet a lot of awesome people.… Read More »I am starting my Masters in Data Science at UTS

CNN with TensorFlow for Deep Learning Beginners

  • Nikita Sharma Nikita Sharma
  • January 29, 2019January 29, 2019
  • data science, deep learning

Hey Readers, this is my first blog on Deep Learning.  From last few months I have started learning  about Deep learning. Thanks to guru99 where… Read More »CNN with TensorFlow for Deep Learning Beginners

Data Engineering Part 1 – How to become a Big Data Engineer

  • Nikita Sharma Nikita Sharma
  • January 15, 2019February 11, 2019
  • AWS, data engineering, ETL, hive, spark

Hey Readers, I am a Data Science Student and recently I have started learning more about Data Engineering. Data Science and Data Engineering teams co-exist… Read More »Data Engineering Part 1 – How to become a Big Data Engineer

Query S3 data via Hive on local box

  • Nikita Sharma Nikita Sharma
  • December 28, 2018December 29, 2018
  • hive

In the last post we discussed about how to generate synthetic data. Here we will talk about how to query S3 data via Hive. Provide… Read More »Query S3 data via Hive on local box

How to generate synthetic log data for data analysis

  • Nikita Sharma Nikita Sharma
  • December 28, 2018December 29, 2018
  • AWS

A lot of time, we want some synthetic data to start our journey on data analysis. In this post we will discuss how to generate… Read More »How to generate synthetic log data for data analysis

Handson SQL guide for Data Science beginners – From databases to data lakes

  • Nikita Sharma Nikita Sharma
  • December 25, 2018December 25, 2018
  • general programming

Are you an aspiring Data Scientist, or a greenhorn Data Science student like me ? Are you trying to start with SQL and are lost… Read More »Handson SQL guide for Data Science beginners – From databases to data lakes

Getting started with MySQL and MySQL Workbench

  • Nikita Sharma Nikita Sharma
  • December 24, 2018December 24, 2018
  • general programming

This post is going to take you through –  how to install and query  MYSQL and MySQL Workbench on your local box. Let’s get started Let’s… Read More »Getting started with MySQL and MySQL Workbench

Public Speaking at Web Analytics Wednesday Meetup, Sydney

  • Nikita Sharma Nikita Sharma
  • December 12, 2018December 12, 2018
  • public speaking, spark

  It was an eventful evening yesterday when I gave my first ever talk. I talked on Analytics with Apache Spark and Zeppelin on Amazon… Read More »Public Speaking at Web Analytics Wednesday Meetup, Sydney

My secret recipe for Kaggle competition for top 8% on leaderboard

  • Nikita Sharma Nikita Sharma
  • December 9, 2018
  • data science, machine learning, visualization

Kaggle is a platform to explore our knowledge on Data Science, to learn new techniques or modelling from the experts. Kaggle competitions are the best way to… Read More »My secret recipe for Kaggle competition for top 8% on leaderboard

Exploratory Data Analysis (EDA) techniques for kaggle competition beginners

  • Nikita Sharma Nikita Sharma
  • November 25, 2018
  • data science, visualization

Exploratory Data Analysis (EDA) is an approach to analysing data sets to summarize their main characteristics, often with visual methods. Following are the different steps involved… Read More »Exploratory Data Analysis (EDA) techniques for kaggle competition beginners

  • « Previous
  • 1
  • 2
  • 3
  • 4
  • 5
  • …
  • 12
  • Next »

Neve | Powered by WordPress