Hive Connector Vs ODBC Connector

TNZL_BI · Post by **TNZL_BI** » Mon Apr 10, 2017 10:43 pm

Hi All ,

I have recently developed a job to connect to the Hive database in the Hadoop Ecosystem. Now I have used two methods to connect to the Hive database which are :-

1. ODBC Connector
2. Hive Connector

However , I am facing massive performance issues with the hive connector stage . Its taking hours to simply load some 80k rows where as when I use the ODBC connector stage , the performance is very good. We see this getting loaded in around 5 minutes time.

Does any one have an idea on this. Ideally the native connector stage should be faster and should have more options but in my case , the performance is really bad ...

Any inputs here will be very helpful.

TNZL_BI · Post by **TNZL_BI** » Sun Apr 30, 2017 6:00 pm

I have just got some patches to be installed on my services / engine tier as suggested by IBM . This may improve the speed. Will do that and then revert back with my findings

AnnDSX · Post by **AnnDSX** » Mon Mar 05, 2018 5:45 am

Hello,

Did you install the patches and see performance enhancement

Thanks

rkashyap · Post by **rkashyap** » Mon Mar 05, 2018 4:16 pm

Hive connector leverages JDBC connectivity.

We are using both ODBC Connector and Hive Connector for connect with Hive and have not seen much difference between the performance of the two.

AnnDSX · Post by **AnnDSX** » Mon Mar 05, 2018 10:58 pm

We are using the FileConnector for moving the files to HDFS and the performance is fair. However the performance of Hive connector is dismal.

The best that we could achieve was writing 1000 records in 20 minutes.