Hi,
Wish you a very Happy and Prosperous New Year !
I am trying to execute a query in Hive using pyhs2 library. My objective is to capture the results of the hive query into a Pandas dataframe. I am experiencing an error in my pyhs2 code beyond a limit (a specific set of rows and columns).
In my Hive query, I am fetching 145 columns and around 4.5 million rows. My code is failing in the following circumstances:
a) When I fetch more than 52 rows and all the 145 columns, the code fails
b) If I reduced the column to 4 then , I can fetch upto a million rows but beyond that it's failing again.
Will you please help me to resolve the error with a possible solution. Thanks in advance !
Cheers
Hi,
Wish you a very Happy and Prosperous New Year !
a) When I fetch more than 52 rows and all the 145 columns, the code fails
b) If I reduced the column to 4 then , I can fetch upto a million rows but beyond that it's failing again.
Cheers