Skip to content
ConfusedCoders
  • data engineering
    • ETL
      • hive
      • pig
    • distributed systems
      • spark
      • hadoop
      • hbase
      • apache drill
    • airflow
    • data storage
      • mongodb
    • search
      • solr
  • data science
    • deep learning
    • machine learning
    • visualization
  • general programming
    • whitepapers
    • open source
    • mobile
    • raspberrypi
    • data structures
    • golang
    • java
      • design patterns
      • hibernate
    • random
      • life
  • about us
    • Nikita Sharma | Data Science Student

data-cleaning

My secret recipe for Kaggle competition for top 8% on leaderboard

December 9, 2018 Nikita Sharmadata science, machine learning, visualization

Kaggle is a platform to explore our knowledge on Data Science, to learn new techniques or modelling from the experts. Kaggle competitions […]

Read more

Exploratory Data Analysis (EDA) techniques for kaggle competition beginners

November 25, 2018 Nikita Sharmadata science, visualization

Exploratory Data Analysis (EDA) is an approach to analysing data sets to summarize their main characteristics, often with visual methods. Following […]

Read more

Cleaning data for data visualisation

October 18, 2018 Nikita Sharmadata science, visualization

This small post provides information on cleaning data by dealing  with missing data present in a dataframe. Data cleaning is […]

Read more

Recent Posts

  • Data Engineering Part 2 – Productionizing Big data ETL with Apache Airflow
  • I am starting my Masters in Data Science at UTS
  • CNN with TensorFlow for Deep Learning Beginners
  • Data Engineering Part 1 – How to become a Big Data Engineer
  • Query S3 data via Hive on local box

Recent Comments

  • Madars Vitolins on Create a basic distributed system in Go lang – Part 1
  • Anushka Mudholkar on How to view content of parquet files on S3/HDFS from Hadoop cluster using parquet-tools
  • Radha on How to install Appium in Ubuntu
  • STP on Setup PyCharm for Deep learning with TensorFlow, Keras and Jupyter (with virtualenv)
  • Shyam on How to view content of parquet files on S3/HDFS from Hadoop cluster using parquet-tools

We love to hear back



Tweet

  • about us
  • Nikita Sharma | Data Science Student
Powered by WordPress | Theme: Astrid by aThemes.