Querying MongoDB via Apache Pig
This crisp post in on querying MongoDB for HDFS Data Transfer, via Pig. Below are the steps involved for the same: 1. Install MongoDB on… Read More »Querying MongoDB via Apache Pig
This crisp post in on querying MongoDB for HDFS Data Transfer, via Pig. Below are the steps involved for the same: 1. Install MongoDB on… Read More »Querying MongoDB via Apache Pig
Quick post on HBase Exception: HBase create statement org.apache.hadoop.hbase.PleaseHoldException hbase(main):002:0> create ‘temptable’, ‘fam1’, ‘fam2’ ERROR: org.apache.hadoop.hbase.PleaseHoldException: org.apache.hadoop.hbase.PleaseHoldException: Master is initializing at org.apache.hadoop.hbase.master.HMaster.checkInitialized(HMaster.java:1841) at org.apache.hadoop.hbase.master.HMaster.createTable(HMaster.java:1333) at sun.reflect.NativeMethodAccessorImpl.invoke0(Native… Read More »HBase create statement org.apache.hadoop.hbase.PleaseHoldException
Lot of time you need to connect to people personally but you do not actually know their emailid. GitHub is a social coding platform where… Read More »Get email id of user using GitHub userid
Data import in Hive by default expects a directory name in its query specified by LOCATION keyword. By default Hive picks up all the files… Read More »Hive – Selected data import/query – Files and folders (mapred.input.dir.recursive)
Here is an awesome post by Tim on the whole lifespan of a Drill Query. Nice read, highly recommended for all the fresh Drill’ers. http://tnachen.wordpress.com/2013/11/05/lifetime-of-a-query-in-drill-alpha-release/… Read More »Awesome post by Timothy Chen – Lifetime of a Query in Drill Alpha Release
Motivation for the post came from a friend who was having trouble finding way to use Drill with JSON Data source. So here comes a… Read More »Apache Drill – Executing sample SQL Queries on JSON Data
The last post here was on integrating Hive 0.11.0 with HBase 0.94.2. But because of issue HIVE-4515 currently we are not able to query HBase… Read More »Integrating Hive 0.9.0 with HBase 0.94.3 – Identifying root cause for RuntimeException: Error while reading from task log url
This is a crisp post on playing with the Open Source Speech Recognition Toolkit – PocketSphinx by Carnegie Mellon University. We need to install the… Read More »Speech Recognition in Python with CMU PocketSphinx (Ubuntu Box)
Came across the error while experimenting with pocketsphinx – CMU Speech Recognition Kit. Modules already installed by package manager: python-pocketsphinx pocketsphinx-hmm-wsj1 pocketsphinx-lm-wsj Python shell is… Read More »Python module ImportError: No module named pocketsphinx
There is a cool post here on Apache wiki : HBase Hive integration .This post is a simplified compilation of the same. Hive: 0.11.0 HBase:… Read More »HBase Hive integration – Querying HBase via Hive
What is a Logical Plan? Apache Drill creates two intermediate plans for its execution – The Logical plan & the Physical plan. The incoming query… Read More »Understanding Apache Drill Logical Plan
Came across this awesome video. Couldn’t help but share this. Bret Victor – Inventing on Principle