Skip to content

ConfusedCoders

the world is opensource
  • data engineering
    • ETL
      • hive
      • pig
    • distributed systems
      • spark
      • hadoop
      • hbase
      • apache drill
    • airflow
    • data storage
      • mongodb
    • search
      • solr
  • data science
    • deep learning
    • machine learning
    • visualization
  • general programming
    • whitepapers
    • open source
    • mobile
    • raspberrypi
    • data structures
    • golang
    • java
      • design patterns
      • hibernate
    • random
      • life
  • about us
    • Nikita Sharma | Data Science Student

ConfusedCoders

the world is opensource
  • data engineering
    • ETL
      • hive
      • pig
    • distributed systems
      • spark
      • hadoop
      • hbase
      • apache drill
    • airflow
    • data storage
      • mongodb
    • search
      • solr
  • data science
    • deep learning
    • machine learning
    • visualization
  • general programming
    • whitepapers
    • open source
    • mobile
    • raspberrypi
    • data structures
    • golang
    • java
      • design patterns
      • hibernate
    • random
      • life
  • about us
    • Nikita Sharma | Data Science Student

Dynamic Sorting Utility

  • debargho chatterjee debargho chatterjee
  • January 3, 2015April 9, 2015
  • java, random

Hi Friends !!! Recently I came across a requirement where in I was supposed to sort a Custom Class based on multiple parameters. Problem Statement… Read More »Dynamic Sorting Utility

Installing Solr on ubuntu

  • Yash Sharma Yash Sharma
  • November 12, 2014
  • solr

Here is a quick dirty post on installing SOLR on your box. Hope its helpful. Download SOLR Get new Solr copy. I got my copy… Read More »Installing Solr on ubuntu

Python error while indexing data in Solr – PySolr Error – 2

  • Yash Sharma Yash Sharma
  • October 27, 2014October 11, 2015
  • random

Error: Traceback (most recent call last): File “index_data.py”, line 47, in main(sys.argv[1:]) File “index_data.py”, line 41, in main s.add(items, commit=True) File “/usr/lib/python2.6/site-packages/pysolr.py”, line 779, in… Read More »Python error while indexing data in Solr – PySolr Error – 2

Python error while indexing data in Solr – PySolr Error – 1

  • Yash Sharma Yash Sharma
  • October 27, 2014October 11, 2015
  • random

Error: Traceback (most recent call last): File “index_data.py”, line 47, in main(sys.argv[1:]) File “index_data.py”, line 41, in main s.add(items, commit=True) File “/usr/lib/python2.6/site-packages/pysolr.py”, line 779, in… Read More »Python error while indexing data in Solr – PySolr Error – 1

How to run pig latin scripts on apache drill

  • Yash Sharma Yash Sharma
  • October 22, 2014October 22, 2014
  • apache drill

This is an initial work on supporting Pig scripts on Drill. It extends the PigServer to parse the Pig Latin script and to get a… Read More »How to run pig latin scripts on apache drill

Mahout usage IncompatibleClassChangeError Exception

  • Yash Sharma Yash Sharma
  • October 22, 2014
  • data engineering, machine learning

The error pops up while using mahout collab filtering on Hadoop 2. Exception in thread “main” java.lang.IncompatibleClassChangeError: Found interface org.apache.hadoop.mapreduce.JobContext, but class was expected at… Read More »Mahout usage IncompatibleClassChangeError Exception

Hive Hangs unexpectedly and ends up with : Error in acquireLock..

  • Yash Sharma Yash Sharma
  • October 22, 2014
  • data engineering

Error in acquireLock… FAILED: Error in acquiring locks: Locks on the underlying objects cannot be acquired. Instant Patchy workaround: SET hive.support.concurrency=false; Unlock table: unlock table… Read More »Hive Hangs unexpectedly and ends up with : Error in acquireLock..

How to convert mongo db json object to csv file

  • Yash Sharma Yash Sharma
  • October 22, 2014October 22, 2014
  • mongodb, random

Quickly scribbled a function to get a plain csv out of mongo db json object. Use the script as you would call any shell script.… Read More »How to convert mongo db json object to csv file

Apache Drill – REST Support

  • Yash Sharma Yash Sharma
  • June 1, 2014June 2, 2014
  • 1 Comment
  • apache drill

This came as a pleasant surprise to me today when I found that Apache Drill now also has an embedded Jetty-Jersey based REST service interface exposed for tracking the status of the Drillbit along with the status of submitted queries.

The interface can be checked out here once the Drillbit is running: http://localhost:8047/statusRead More »Apache Drill – REST Support

Contributing to Apache Drill – Part 2 : Freemarker Code gen implementation

  • Yash Sharma Yash Sharma
  • May 25, 2014May 26, 2014
  • 2 Comments
  • apache drill

Implement Drill Trigonometric functions – Using Freemarker code generation This post is a followup to this last post Contributing to Apache Drill – Math Functions.… Read More »Contributing to Apache Drill – Part 2 : Freemarker Code gen implementation

Cross platform encryption decryption using Java/C#

  • Yash Sharma Yash Sharma
  • May 12, 2014May 29, 2014
  • 8 Comments
  • random

Cross Platform Encryption Decryption Encryption and Decryption have been very important modules for any enterprise application. Whether is a file on our system or the… Read More »Cross platform encryption decryption using Java/C#

Use Hive Serde for Fixed Length (index based) strings

  • Yash Sharma Yash Sharma
  • May 12, 2014May 12, 2014
  • 2 Comments
  • hive

Hive fixed length serde can be used in scenarios where we do not have any delimiters in out data file. Using RegexSerDe for fixed length strings… Read More »Use Hive Serde for Fixed Length (index based) strings

  • « Previous
  • 1
  • …
  • 7
  • 8
  • 9
  • 10
  • 11
  • 12
  • Next »

Neve | Powered by WordPress