Quantcast
Channel: Pentaho Community Forums
Viewing all 16689 articles
Browse latest View live

“All” grouping includes filtered measurement values

$
0
0
I have an MDX statement that filters a dataset based on Measurement_1 (Measurement_1 >= 70) across three dimensions (via crossjoin). The results per value of each Dimension_0, Dimension_1 and Dimension_2 combination is correct and the filter is working as expected. The MDX statement is executed against Kylin using Mondrian.
The problem arises in the "All" part of the resultset. If the values of Measurement_0 (not the filtered measurement) is summed together and compared to the "All" part of the result set, there is a difference in value.
Below is the formatted result set:

Code:

{
  "report": [
    {
      "All Dimension_0": [
        {
          "XYZ": [
            {
              "ABC": {
                "[Measures].[Measurement_1]": 75.08282389023466,
                "[Measures].[Measurement_0]": 191220044878
              }
            }
          ]
        }
      ]
    },
    {
      "Dimension_0 value 0": [
        {
          "XYZ": [
            {
              "ABC": {
                "[Measures].[Measurement_1]": 74.039688653933,
                "[Measures].[Measurement_0]": 9031912913
              }
            }
          ]
        }
      ]
    },
    {
      "Dimension_0 value 1": [
        {
          "XYZ": [
            {
              "ABC": {
                "[Measures].[Measurement_1]": 75.15194214129325,
                "[Measures].[Measurement_0]": 175519866354
              }
            }
          ]
        }
      ]
    },
    {
      "Dimension_0 value 2": [
        {
          "XYZ": [
            {
              "ABC": {
                "[Measures].[Measurement_1]": 75.42622332399225,
                "[Measures].[Measurement_0]": 6246658572
              }
            }
          ]
        }
      ]
    }
  ]
}




The value of Measurement_0 on "All Dimension_0" grouping equals 191,220,044,878, but the sum of Measurement_0 on the subsequent groupings equals 190,798,437,839. Which is a difference of 421,607,039.
On removing the filter from the MDX statement, an extra grouping appears in the result set (as expected), with a Measurement_1 value outside the filter (specifically equalling null). The value of Measurement_0 of this grouping is equal to the difference between the "All" grouping and the manual summing of the groupings shown above.
Below is the additional grouping that appears on the removal of the filter:

Code:

{
  "Dimension_0 value 3": [
    {
      "XYZ": [
        {
          "ABC": {
            "[Measures].[Measurement_1]": null,
            "[Measures].[Measurement_0]": 421607039
          }
        }
      ]
    }
  ]
}



Below is the MDX statement with the filter:

Code:

SELECT {Measurement_0, Measurement_1} ON AXIS(0),
        {
            FILTER(
                    CROSSJOIN(
                        CROSSJOIN(
                                    {[Dimension_0].Members}, {[Dimension_1].XYZ}
                                ), {[Dimension_2].ABC}
                        ), (((Measurement_1 >= 70))))} ON AXIS(1)
FROM <table>



Is there a way to exclude the value of Measurement_0 from the "All" calculation as the grouping is not returned as part of the filtered resultset?

I have tried to use VisualTotals after the filter has been applied, but it seems VisualTotals works with members of the same hierarchy, where the FILTER returns tuples.

Any advice or direction appreciated.

Pieter



How to generate dynamic reports based on user's column specific access permission?

$
0
0
I am fairly new to Pentaho and wondering if there is a way to achieve my requirement.

I want to create a dynamic report using Pentaho Report Editor which can be accessed by users based on their column/variable specific access permission.

Consider there is a table with 5 columns - C1, C2, C3, C4, C5 and a single Pentaho report which displays all these columns in a tabular format.
I have three users U1, U2 and U3. I want user U1 to generate the report where he can see only columns C1, C2 and C3, the user U2 should see the columns C3, C4 and C5 and the user U3 should see all the columns.

I assume this can be done with the help of Pentaho Metadata Editor. Please confirm if it is possible and help me with the solution.

Appreciate your help. :)

Failed at query: SELECT

$
0
0
I have published a report and when I run it from the Pentaho User Console all i get is a message: 'Failed query at: SELECT'

change login id , password in kettle

$
0
0
hi
I am Afroz .I want to change kettle login id ,password
please tell me the options in kettle

Dynamic mapping of input fields to output fields

$
0
0
hi,

I have input file which has Fields like "Name","Number""Address". Which are supposed to be named as "Name_2016","Number","Adress_2016" respectively. But in future my input fields names and output field names may change. For This there any way to do mapping so tat I need not to open my KTR file and modify.Thee should be some common location where I modify once and all the KTR

Thanks in advance,
Hemavathi

Will "Get Rows From Result" work in Azure webjob in cloud?

$
0
0
Hello,
We have recently migrate the kettle development code to MS Azure and created a webjob to run a kettle job, which has "Copy Rows to Result" in 1st tranformation and " Get rows from result" in the 2nd transformation...
This is working fine in local machine but when running the webjob in azure "Get rows from rows" is giving zero records...

Can you pls help me? how to resolve this error?

Thanks!!

PDI Logging

$
0
0
Hello,

I have various number of Jobs containing several number of Transformations who are running through a scheduler using bat file. I want to generate a single Log table which will contain the Job name,name of its transformations and its steps,Start Date Time and End Date Time.

I have already checked the Job and transformation Logging settings, but it doesn't provide consolidated information as mentioned in above format.

Thanks in advance

Is SYSTEM tablespace access required when configuring Pentaho with Oracle repository

$
0
0
We are deploying Pentaho 6.1 BA platform to our UAT labs using Oracle repository.
On reviewing the scripts to create users (create_jcr_ora.sql etc.), the deployers have an issue granting SYSTEM tablespace access to users, for e.g.,
create user <jcr, quartz, pentaho_operations_mart..> identified by "password" default tablespace pentaho_tablespace quota unlimited on pentaho_tablespace temporary tablespace temp quota 5M on system;


- Why does the temporary tablespace need a quota of 5M on the SYSTEM tablespace?
- Would the default temp tablespace work just fine as an alternative?

Set HTTP header properties for dynamic image URL?

$
0
0
I’m using the Pentaho classic engine to create reports with dynamic images through URLs. At the URL endpoints, it requires a security token that has to be set at the HTTP header for access. Is there any way I can set this HTTP header property somewhere in the Pentaho classic engine, every time when the engine tries to load the images at those URLs?

Thanks for any suggestions.

How to change column name using parameter?

$
0
0
Hello everyone,
I have a query that gets the parameters (five) from a webpage. I want my column header name as the parameter value.
I can easily do that by replace field name if a user selects all, but according to my requirement I allow user to select only two at a time.

I have searched through kettle transformation steps, but I didn't find any kettle step to do that.
Is there any way to do that using javascript?
If yes, Please give me a direction.
(I want to use that kettle for chart)

Let's say A, B, C, D, E there are five parameter names.
If the user selects A, B then column name should be
A B
and if the user selects A, D then column name should be
A D


Thanks

No prompt for authentication when using direct link to Dashboard

$
0
0
We gave a client link so he can see dashboard and it used to work fine.
In sense that if client was not logged in, pentaho used to ask user to login.. At the moment this is not happening. Client is getting 500 error and in pentaho.log we have this error:

ERROR [org.pentaho.platform.web.http.api.resources.RepositoryResource] Repository file [: public: Dashboards:MyDashboard.wcdf] not found

Do you know why is this happening?
I should mention that we enable URL login (?userid=user&password=password) but we think issue started before we enabled this feature.

Thank you in advance

Shell Script step: Does it exist anymore?

$
0
0
Hi,
As part of my transformations I would like to run a shell command: myCommand.sh
Looking around I see examples of folks using a step called Shell Script, but in my PDI version (6.1 ce) I don't see this step anywhere. Is it deprecated/gone now?

As an alternative I'm feeing an Execute a process step with a data grid that contains the command I want to run. But I want to also capture or display the output of the command and I don't see the way to do this.

Thanks for your help and advice

Selecting Applying We are having problems when applying the "NumericToNominal" filter

$
0
0
We are having problems when applying the "NumericToNominal" filter to convert our dependent variable. When we apply it by checking only the depdendent variable, it not only converts the dependent variable but it also converts the rest of the independent variables also.



How do we selectively convert only those variables to Nominal which are categorical?

Thank you for your guidance in advance!

Google Analytics Step in azure webjob

$
0
0
Hi All,

We are running Azure webjob and it has Google Analytics Input step . It is not initiating through webjob . Do we need to do anything to make it run in webjob ?

This is perfectly working fine in local windows machine. Highly appreciate for your help!!

Thanks!!

Excel with multiple sheets Input

$
0
0
Hi All,

I have a excel file with multiple sheets (size is large though).
I need to export all the data from multiple sheets to a single database table.
I am using Excel 2007 XLSX (Apache POI Streaming) for the large set of data.

But i am unable to preview the records, it says java heap space issue.

if "%PENTAHO_DI_JAVA_OPTIONS%"=="" set PENTAHO_DI_JAVA_OPTIONS="-Xmx512m" "-XX:MaxPermSize=256m"

this is what i got in my spoon.bat

Please help how to achieve this

Java based data export Library in PDI

$
0
0
I am looking for a Java based library to embed into my Spring based web application to enable exporting of data in SQL from a MySQL database via JDBC into file that can be downloaded.There will be multiple deployments of this which I do not control so cannot be certain that the command line tools will be available in the path, embedding them into the WAR file increases its size by over 50MB since I have to support 32/64 bit Windows and Linux. Seeing that PDI has similar functionality as part of its featureset I am wondering whether I can leverage the libraries used to achieve this. Thanks in advance

Issue with Invoking Loop{For Each Row Entry} from Azure Web Job

$
0
0
First of all I am new to Kettle PDI and Pentaho in general.
Can someone guide me on the below issue I am facing
Received below error message when I submit the Kettle Job via Azure Web Job
Below is the snapshot of the error message.

[10/20/2016 08:38:39 > 5a0168: INFO] 2016/10/20 08:38:39 - GetGAViewIds - Dispatching started for transformation [GetGAViewIds]
[10/20/2016 08:38:41 > 5a0168: INFO] 2016/10/20 08:38:41 - Table input.0 - Finished reading query, closing connection.
[10/20/2016 08:38:41 > 5a0168: INFO] 2016/10/20 08:38:41 - Copy rows to result.0 - Finished processing (I=0, O=0, R=5, W=5, U=0, E=0)
[10/20/2016 08:38:41 > 5a0168: INFO] 2016/10/20 08:38:41 - Table input.0 - Finished processing (I=5, O=0, R=0, W=5, U=0, E=0)
[10/20/2016 08:38:41 > 5a0168: INFO] 2016/10/20 08:38:41 - GetGAData - Starting entry [LoadGAdata]
[10/20/2016 08:38:41 > 5a0168: INFO] 2016/10/20 08:38:41 - LoadGAdata - Loading transformation from XML file [file:///D:/home/site/wwwroot/data/job_daily/InputSplit.ktr]
[10/20/2016 08:38:41 > 5a0168: INFO] 2016/10/20 08:38:41 - InputSplit - Dispatching started for transformation [InputSplit]
[10/20/2016 08:38:42 > 5a0168: ERR ] 2016/10/20 08:38:42 - GA_Search_pull.0 - ERROR (version 5.4.0.1-130, build 1 from 2015-06-14_12-34-55 by buildguy) : Error initializing step [GA_Search_pull]
[10/20/2016 08:38:42 > 5a0168: ERR ] 2016/10/20 08:38:42 - InputSplit - ERROR (version 5.4.0.1-130, build 1 from 2015-06-14_12-34-55 by buildguy) : Step [GA_Search_pull.0] failed to initialize!
[10/20/2016 08:38:42 > 5a0168: INFO] 2016/10/20 08:38:42 - GetGAData - Finished job entry [LoadGAdata] (result=[false])
[10/20/2016 08:38:42 > 5a0168: ERR ] 2016/10/20 08:38:42 - LoadGAdata - ERROR (version 5.4.0.1-130, build 1 from 2015-06-14_12-34-55 by buildguy) : Unable to prepare for execution of the transformation
[10/20/2016 08:38:42 > 5a0168: INFO] 2016/10/20 08:38:42 - GetGAData - Finished job entry [GetGAViewIds] (result=[false])
[10/20/2016 08:38:42 > 5a0168: ERR ] 2016/10/20 08:38:42 - LoadGAdata - ERROR (version 5.4.0.1-130, build 1 from 2015-06-14_12-34-55 by buildguy) : org.pentaho.di.core.exception.KettleException:
[10/20/2016 08:38:42 > 5a0168: INFO] 2016/10/20 08:38:42 - GetGAData - Finished job entry [Delete files] (result=[false])

Eliminating values in one excel sheet based on content in other Excel sheet

$
0
0
Hi All,

I have a excel sheet where i need to exclude records based on content of another excel sheet.

Just for example lets say Excel sheet 1 has column called name

Name
-------------
ABC
PQR
STU
VCX

And excel sheet 2 has column called name as well

Name
--------
ABC

So this name will be excluded in Excel sheet 1 and modified excel sheet will look like this

Name
-------------
PQR
STU
VCX


Thanks in Advance

Launching a report from a standalone java application

$
0
0
Hi.

I have generated a report from Pentaho RD. When released the report from RD everything appears correctly.
The problem arises when I call this report from a standalone java application. In this case overprinted in the report fields used by the main Query. (I put a sample image).

Camposresaltados.PNG

What I can do to avoid these overprinted fields appear?
I just want to leave the fields and texts which I define in the report. Like the next image:

SinCampos.PNG

Thanks in advance
Attached Images

Duplicate names merged but not summarized (schema)

$
0
0
A dimension table "DIM_B" is dependant of "DIM_A". Multiple rows in DIM_B has the same name in column "NAME".

The problem is Mondrian merges all rows with duplicate names.

Example of dimensions:

<Dimension foreignKey="DIM_A" name="A" type="StandardDimension">
<Hierarchy primaryKey="ID" name="A" allLevelName="(ALL)" hasAll="true">
<Table name="DIM_A"/>
<Level name="Name" column="ID" nameColumn="NAME" ordinalColumn="NAME" type="Integer" uniqueMembers="true" levelType="Regular" hideMemberIf="Never"/>
</Hierarchy>
</Dimension>

<Dimension foreignKey="DIM_B" name="Operations" type="StandardDimension">
<Hierarchy primaryKey="ID" name="Operation" allLevelName="(ALL)" hasAll="true">
<Table name="DIM_B"/>
<Level name="Name" column="ID" nameColumn="NAME" ordinalColumn="NAME" type="Integer" uniqueMembers="true" levelType="Regular" hideMemberIf="Never"/>
</Hierarchy>
</Dimension>



So two rows can have same name like this:

ID = 11, A = 1, B = 1, Name = "Name1", Value = 1
ID = 12, A = 2, B = 1, Name = "Name1", Value = 1

Would be displayed like this:

Name1, Value 1

But how can the dimensions be configured to show both of these rows? Alternatively, how to show them merged but also summarized values?
Viewing all 16689 articles
Browse latest View live


<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>