Skip to content

Tag: hadoop

Unable to submit concurrent Hadoop jobs

I am running Hadoop 2.7 on my local machine, along with HBase 1.4 and Phoenix 4.15. I have written an application which submits map reduce jobs that delete data in HBase through Phoenix. Each job is run by an individual thread of a ThreadPoolExecutor and looks like this: Everything is fine if there is only 1 …

jps command for Hadoop processes

there. I have hadoop 2.4.1 running on ubuntu. Executing jps command, I am getting this output: is it normal to get “3794 org.eclipse.equinox.launcher_1.5.0.v20180512-1130.jar” along with the output of jps? I am asking because didn’t get it before. Suddenly, it started to give this result wit…

Unable to connect to Phoenix using JDBC

I have a Hadoop Cluster set up with HBase and Phoenix and I’m trying to connect to Phoenix using JDBC, but I am sort of unable to get a successful connection. I want to use JDBC to connect using Python 3.x but as for simple test purposes I set up a connection using Java in Eclipse. I was originally usin…

How to do CopyMerge in Hadoop 3.0?

I know hadoop version 2.7’s FileUtil has the copyMerge function that merges multiple files into a new one. But the copyMerge function is no longer supported per the API in the 3.0 version. Any ideas on how to merge all files within a directory into a new single file in the 3.0 version of hadoop? Answer …