cancel
Showing results for 
Search instead for 
Did you mean: 

BO PERFORMANCE ISSUE WITH CLOUDERA IMPALA HADOOP

0 Kudos

Hi Experts,


Hope you all are doing well. I want to take some expert opinion here: We are switching from Oracle to Hadoop due to slow performance with Oracle DB, built a universe with Cloudera Simba ODBC connections scheduled a report expecting a faster performance compare to Oracle DB but the report took more than 2 hours, took the same query and ran in HUE SQL editor the result got back in less than 2 mins

We tested in DEV, TEST, & PROD, & also tried switching to JDBC connection little improvement in performance, we feel its the network's latency issue. Points to note here that our Hadoop servers and BO servers are in two different locations NCAL and SCAL, we have 3.5 million records to pull

I am looking for some tested advice here on this issue if anyone has already faced such issue
Regards,


Ahmed

Accepted Solutions (0)

Answers (2)

Answers (2)

sonet_kebede
Advisor
Advisor
0 Kudos

Hello,

If the data source is Hadoop than you should use Apache Simba JDBC \ODBC. Please check our supported platform.

I hope this could help.

Thanks,

Sonet

sonet_kebede
Advisor
Advisor
0 Kudos

Hello,

What's the version of BO you are using?

What's the version of Hadoop (Hive1 or Hive2)?

What is the reason you are using Cloudera Simba ODBC\JDBC vs Apache Simba JDBC \ODBC

If you have Hadoop than you should use Apache Simba JDBC \ODBC

Thanks,

Sonet

0 Kudos

Bo version is 4.2. I am not sure about Hadoop Version I use the HUE editor web interface to run queries and the HUE version in 3.10. We are using Impala in hadoop so using Cloudera Impala 2.0 - Simba JDBC Drivers as connection.