Hi Guys, I am very new to Pentaho and I am thinking about using Pentaho in combination with Hive as an alternative to R.
I want to do the following: From Pentaho I want to submit a Hive query on a Hadoop Cluster where it is executed. After that I want to analyse the results from this query by computing the correlations between two columns which are part of the result set and I want to visualize them. I allready managed to submit a Hive query from the Pentaho Report Designer but I was not able to view the Result from Pentaho or even to visualize them. So my question is first: Is it possible to use Pentaho for this described use case? And the second question: Which of the Pentaho solutions should I download? It seems that the Report Designer is not able to analyse the result from hive. So is it the Pentaho data integration? Or Pentaho Big Data?
Help would be really apreciated.
I want to do the following: From Pentaho I want to submit a Hive query on a Hadoop Cluster where it is executed. After that I want to analyse the results from this query by computing the correlations between two columns which are part of the result set and I want to visualize them. I allready managed to submit a Hive query from the Pentaho Report Designer but I was not able to view the Result from Pentaho or even to visualize them. So my question is first: Is it possible to use Pentaho for this described use case? And the second question: Which of the Pentaho solutions should I download? It seems that the Report Designer is not able to analyse the result from hive. So is it the Pentaho data integration? Or Pentaho Big Data?
Help would be really apreciated.