Installing Solr on ubuntu
Here is a quick dirty post on installing SOLR on your box. Hope its helpful. Download SOLR Get new Solr copy. I got my copy… Read More »Installing Solr on ubuntu
Here is a quick dirty post on installing SOLR on your box. Hope its helpful. Download SOLR Get new Solr copy. I got my copy… Read More »Installing Solr on ubuntu
This is an initial work on supporting Pig scripts on Drill. It extends the PigServer to parse the Pig Latin script and to get a… Read More »How to run pig latin scripts on apache drill
The error pops up while using mahout collab filtering on Hadoop 2. Exception in thread “main” java.lang.IncompatibleClassChangeError: Found interface org.apache.hadoop.mapreduce.JobContext, but class was expected at… Read More »Mahout usage IncompatibleClassChangeError Exception
Error in acquireLock… FAILED: Error in acquiring locks: Locks on the underlying objects cannot be acquired. Instant Patchy workaround: SET hive.support.concurrency=false; Unlock table: unlock table… Read More »Hive Hangs unexpectedly and ends up with : Error in acquireLock..
Quickly scribbled a function to get a plain csv out of mongo db json object. Use the script as you would call any shell script.… Read More »How to convert mongo db json object to csv file
This came as a pleasant surprise to me today when I found that Apache Drill now also has an embedded Jetty-Jersey based REST service interface exposed for tracking the status of the Drillbit along with the status of submitted queries.
The interface can be checked out here once the Drillbit is running: http://localhost:8047/statusRead More »Apache Drill – REST Support
Implement Drill Trigonometric functions – Using Freemarker code generation This post is a followup to this last post Contributing to Apache Drill – Math Functions.… Read More »Contributing to Apache Drill – Part 2 : Freemarker Code gen implementation
Hive fixed length serde can be used in scenarios where we do not have any delimiters in out data file. Using RegexSerDe for fixed length strings… Read More »Use Hive Serde for Fixed Length (index based) strings
This crisp post in on querying MongoDB for HDFS Data Transfer, via Pig. Below are the steps involved for the same: 1. Install MongoDB on… Read More »Querying MongoDB via Apache Pig
Quick post on HBase Exception: HBase create statement org.apache.hadoop.hbase.PleaseHoldException hbase(main):002:0> create ‘temptable’, ‘fam1’, ‘fam2’ ERROR: org.apache.hadoop.hbase.PleaseHoldException: org.apache.hadoop.hbase.PleaseHoldException: Master is initializing at org.apache.hadoop.hbase.master.HMaster.checkInitialized(HMaster.java:1841) at org.apache.hadoop.hbase.master.HMaster.createTable(HMaster.java:1333) at sun.reflect.NativeMethodAccessorImpl.invoke0(Native… Read More »HBase create statement org.apache.hadoop.hbase.PleaseHoldException
Data import in Hive by default expects a directory name in its query specified by LOCATION keyword. By default Hive picks up all the files… Read More »Hive – Selected data import/query – Files and folders (mapred.input.dir.recursive)
Here is an awesome post by Tim on the whole lifespan of a Drill Query. Nice read, highly recommended for all the fresh Drill’ers. http://tnachen.wordpress.com/2013/11/05/lifetime-of-a-query-in-drill-alpha-release/… Read More »Awesome post by Timothy Chen – Lifetime of a Query in Drill Alpha Release