Skip to content

Unable to successfully run pyhs2 code beyond a certain set of rows and columns #66

@MyPythonGitHub

Description

@MyPythonGitHub

Hi,
Wish you a very Happy and Prosperous New Year !

     I am trying to execute a query in Hive using pyhs2 library. My objective is to capture the results of the hive query into a Pandas dataframe. I am experiencing an error in my pyhs2 code beyond a limit (a specific set of rows and columns).

      In my Hive query, I am fetching 145 columns and around 4.5 million rows. My code is failing in the following circumstances:

a) When I fetch more than 52 rows and all the 145 columns, the code fails

b) If I reduced the column to 4 then , I can fetch upto a million rows but beyond that it's failing again.

        Will you please help me to resolve the error with a possible solution. Thanks in advance !

Cheers

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions