Saiku 3.0 unable to cancel running query

April 10, 2014, 6:38 am

≪ Previous: All the files are not installed when installing CTOOLS for pentaho 5.0

Cancelling running query does not work in saiku 3.0

DELETE http://url/saiku/rest/saiku/api/quer...9C7D7E9/result
for saiku 2.6 is working fine and we get the following Exception

"QueryCanceledException: Mondrian Error:Query canceled"

while in saiku 3.0 the request actually throws a "404" on the server.

↧

AddAttribute file help

April 10, 2014, 7:17 am

≫ Next: PDI Normalize row scenario

≪ Previous: Saiku 3.0 unable to cancel running query

Hi, i'm trying to work with WEKA using jAudio that outputs audio features to a .arff file, in the tutorial here http://www.cs.cmu.edu/~music/cmsip/projects/p6.pdf it says i need to use the AddAttribute.java file to add in a genre attribute but i can't get it working. I'm pretty new to Java so not entirely sure how to change the code for my needs.

Ive saved the AddAttribute.java file in with the weka.jar and the file that holds the .arff files, i tried both changing the code to the .arff's file name and changed the file name to 'file.arff. like its done in the AddAttributes code and tried opening the .arff file but with no luck.
It does still open WEKA fine but only with the attributes in the .arff rather than adding the new one
http://weka.wikispaces.com/Adding+at...s+to+a+dataset is where it shows the code for AddAttribute any help would be greatly appreciated thanks

also the values of the attributes already present a numerical but the new class should be nomical so it can use a naive bayes classifier or to cluster i think

↧

PDI Normalize row scenario

April 10, 2014, 7:41 am

≫ Next: Histograms

≪ Previous: AddAttribute file help

I have the following data set as an output form a table query

Codes Value
10010 TestName1
10020 Type1
10030 Value1
10110 TestName2
10120 Type2
10130 Value2

In the code first 3 characters mean identify a row to normalize to and the last 2 characters the values. The outcome of these 6 rows will become 2 in the form

Code Name Type Value
100 TestName1 Type1 Value1
101 TestName2 Type2 Value2

How can this be completed using standard Pentaho PDI steps?

Thanks for any help.

↧

Histograms

April 10, 2014, 7:49 am

≫ Next: Error Sucks !! org.pentaho.di.trans.steps.userdefinedjavaclass.FieldHelper.setValue(F

≪ Previous: PDI Normalize row scenario

I am trying to write an MDX query to generate histogram. The histogram buckets are based on a sequence and a dollar amount. In each bucket we are placing a count of how many sequences are between two dollar amounts. We are applying filters to the data so those sequences could end up in different buckets depending on the filters the application applies. We attempted to use the filter function to build the buckets but that results in Mondrian generating in clauses that are too big for the database to handle. We attempted to create UDF the fixes filter but we ended up having same problem.

If the buckets were based on time instead of dollars amounts or the sequences could be fixed to a bucket no matter what filters are applied this would work. Has anyone had any success on doing this kind of histogram with Mondrian?

↧

Error Sucks !! org.pentaho.di.trans.steps.userdefinedjavaclass.FieldHelper.setValue(F

April 10, 2014, 7:50 am

≫ Next: java.lang.OutOfMemoryError for Apriori association on pretty small dataset

≪ Previous: Histograms

Hi,

I am getting the following exception when setting the output fields in UDJC.

2014/04/10 20:15:07 - Get SDP Range Value.0 - ERROR (version 4.4.0-stable, build 17588 from 2012-11-21 16.02.21 by buildguy) : 11
2014/04/10 20:15:07 - Get SDP Range Value.0 - ERROR (version 4.4.0-stable, build 17588 from 2012-11-21 16.02.21 by buildguy) : java.lang.ArrayIndexOutOfBoundsException: 11
2014/04/10 20:15:07 - Get SDP Range Value.0 - ERROR (version 4.4.0-stable, build 17588 from 2012-11-21 16.02.21 by buildguy) : at org.pentaho.di.trans.steps.userdefinedjavaclass.FieldHelper.setValue(FieldHelper.java:92)
2014/04/10 20:15:07 - Get SDP Range Value.0 - ERROR (version 4.4.0-stable, build 17588 from 2012-11-21 16.02.21 by buildguy) : at Processor.processRow(Processor.java:75)
2014/04/10 20:15:07 - Get SDP Range Value.0 - ERROR (version 4.4.0-stable, build 17588 from 2012-11-21 16.02.21 by buildguy) : at org.pentaho.di.trans.steps.userdefinedjavaclass.UserDefinedJavaClass.processRow(UserDefinedJavaClass.java:1181)
2014/04/10 20:15:07 - Get SDP Range Value.0 - ERROR (version 4.4.0-stable, build 17588 from 2012-11-21 16.02.21 by buildguy) : at org.pentaho.di.trans.step.RunThread.run(RunThread.java:50)
2014/04/10 20:15:07 - Get SDP Range Value.0 - ERROR (version 4.4.0-stable, build 17588 from 2012-11-21 16.02.21 by buildguy) : at java.lang.Thread.run(Thread.java:662)

Please have a look in the image.

If I remove the below line from UDJC, it works,

get(Fields.Out, "DA").setValue(r, "1");

Please advice.

udjc_image.jpg

Attached Images

udjc_image.jpg (38.7 KB)

↧

java.lang.OutOfMemoryError for Apriori association on pretty small dataset

April 10, 2014, 9:39 am

≫ Next: XML read disables Prune path

≪ Previous: Error Sucks !! org.pentaho.di.trans.steps.userdefinedjavaclass.FieldHelper.setValue(F

Hello,

I'm trying to run the Apriori association in weka-3-7-10-oracle-jvm on OS X. I've tried to turn up the memory allocation by running it from command line:

$ java -Xmx4096M -jar weka.jar

The data set is pretty small:

=== Run information ===

Scheme: weka.associations.Apriori -N 10 -T 0 -C 0.9 -D 0.05 -U 1.0 -M 0.1 -S -1.0 -c -1
Relation: orderitems_denormalized-weka.filters.unsupervised.instance.Denormalize-Gfirst-ASum-S-weka.filters.unsupervised.attribute.Remove-R1
Instances: 2556
Attributes: 115

But I'm getting the error:
Exception in thread "Thread-12" java.lang.OutOfMemoryError: Java heap space
weka.associations.AprioriItemSet.mergeAllItemSets(AprioriItemSet.java:542)
weka.associations.Apriori.findLargeItemSets(Apriori.java:1673)
weka.associations.Apriori.buildAssociations(Apriori.java:519)
weka.gui.explorer.AssociationsPanel$7.run(AssociationsPanel.java:406)

Is this as expected? What can/should I do?

↧

XML read disables Prune path

April 10, 2014, 11:15 am

≫ Next: [Microsoft][ODBC Driver Manager] Invalid string or buffer length

≪ Previous: java.lang.OutOfMemoryError for Apriori association on pretty small dataset

I'm reading very large XML files using Get XML Data step. I would like to use the "Prune path to handle large files" option which improves performance drastically.
However, if I accept the files from previous step (by checking "XML source is defined in a field?"), spoon disables the Prune path option. Any suggestions?

Version: PDI 5.0.1
OS: ubuntu 12.04 LTS

↧

[Microsoft][ODBC Driver Manager] Invalid string or buffer length

April 10, 2014, 11:39 am

≫ Next: Ccc - faq

≪ Previous: XML read disables Prune path

Hey, I'm hoping someone can help out here, as I'm pretty exhausted trying to find a solution. Any help is greatly appreciated!

I'm trying to extract some data from Access to SQL Server. I've defined 8 different ODBC connections on the server, and this job runs for each. It knows how to swap them out, so assume that all works. I'm getting the following error on *some* tables, but not all:

Code:



ERROR 09-04 23:26:23,340 - MS Access <TableName> - org.pentaho.di.core.exception.KettleDatabaseException: 

Couldn't get row from result set



Unable to get value 'String(7)' from database resultset, index 0

[Microsoft][ODBC Driver Manager] Invalid string or buffer length





    at org.pentaho.di.core.database.Database.getRow(Database.java:2758)

    at org.pentaho.di.core.database.Database.getRow(Database.java:2730)

    at org.pentaho.di.trans.steps.tableinput.TableInput.processRow(TableInput.java:153)

    at org.pentaho.di.trans.step.RunThread.run(RunThread.java:50)

    at java.lang.Thread.run(Unknown Source)

Caused by: org.pentaho.di.core.exception.KettleDatabaseException:

I'm hesitant to point to any environment issues, as when I Play this through spoon, it works *perfectly* all the time.

Can anyone think of what is different between Spoon and Kitchen?

Here's the command line from kitchen.bat that's getting run:

"C:\Program Files\Java\jre7\bin\java.exe" -Xmx512m "-Djava.library.path=libswt\win64" "-DKETTLE_HOME=D:\KettleHome" "-DKETTLE_REPOSITORY=" "-DKETTLE_USER=" "-DKETTLE_PASSWORD=" "-DKETTLE_PLUGIN_PACKAGES=" "-DKETTLE_LOG_SIZE_LIMIT=" -jar "C:\Kettle\pdi-ce-4.4.0-stable\data-integration\launcher\launcher.jar" -lib ..\libswt\win64 -main org.pentaho.di.kitchen.Kitchen /rep:ValidRepName /job:"JobName"

And some software information:
-OS Version: Win Server 2008 R2 64-bit
-PDI Version: 4.4 Stable CE
-JRE Version: 7 build 51

↧

Ccc - faq

April 10, 2014, 11:42 am

≫ Next: PDI config questions regarding spoon and remote access

≪ Previous: [Microsoft][ODBC Driver Manager] Invalid string or buffer length

This thread post is an index to frequent questions and selected answers about CCC.
It will be updated over time.

FAQ
----

Documentation?
1. Site contains live examples, summary of options and extension points for each chart type
2. Options reference documentation
3. Protovis reference documentation
  - watchout for the following differences: pvc.Dot#{angle, radius, size} should be renamed to pv.Dot#{shapeAngle, shapeRadius, shapeSize}
4. Upgrading to v2: A
5. Fiddles: A
Environment
1. «Where should I put that piece of code?»: A, B, C, D, E
2. Using CCC in multiple environments: A
3. Debugging: A
4. V1 compatibility mode: A
Object model
1. Extension point: A, B
2. Visual API: A
3. «Something like "this.scene.vars.series" means absolutely nothing to me»: A, B
Interaction
1. Handling the click action: A, B
2. Handling the selectionChanged action: A, B
3. Fast resize-re-render: A
4. First render with some datums (~rows) already selected: A
5. First render (sort of) with some series hidden: A
Tooltip
1. Default format for a value: A, B
2. Hiding a dimension from the user: A
Data source translation
1. Relational vs Crosstab formats: A
2. What's the expected data format? .....
3. Virtual item structure: A, B, C, D
4. Many ways to specify "readers": A, B
5. Sort by value and TOP N: A
6. Multiple charts, one data source: A
Data, Dimensions
1. Explicitly defining dimensions: A
2. Changing default labels of dimensions: A
3. Calculations; "dataPart" dimension and visual role; custom partitioning of data between the main plot and plot2: A
4. Data aggregation in categorical charts: A, B
Visual Roles, Axes
1. Reverse the values of visual roles (category, series, color, ...): A, B
2. Fixed colors for some series values: A, B
3. Second plot with a second Y axis: A
4. Generating integer ticks: A
5. Hiding cartesian axes' rule: A
6. Remove the frame/rectangle around the plot: A
Charts
1. Bar
  1. Styling bars: A
Walkthroughs
1. "Paired Bar-Line Measures" site example explained: A
2. "Simulating" a statistical dot plot: A

↧

PDI config questions regarding spoon and remote access

April 10, 2014, 1:23 pm

≫ Next: CCC FAQ: Frequently Asked Questions About CCC

≪ Previous: Ccc - faq

I've a remote server running ubuntu 12.04. Pentago bi-server ce 5.0.1 is installed and I can login as admin. Now for data integration: tutorials say just unzip and run the sh script spoon.sh. But I get "missing file" errors (mostly java swt). Now I'm wondering whether spoon should be on the server or my desktop. It sounds like this can't be accessed by a tomcat URL like the bi server. I thought the PDI would be a tomcat access via my browser and I would extract, transform, and Load (do my ETL) something like connecting to some databases via phpmyadmin.

Please help me out in my misunderstnading.

Thank you

↧

CCC FAQ: Frequently Asked Questions About CCC

April 10, 2014, 11:42 am

≫ Next: upload files to ftps creating connection timeout

≪ Previous: PDI config questions regarding spoon and remote access

This thread post is an index to frequent questions and selected answers about CCC.
It will be updated over time.

FAQ
----

Documentation?
1. Site contains live examples, summary of options and extension points for each chart type
2. Options reference documentation
3. Protovis reference documentation
  - watchout for the following differences: pvc.Dot#{angle, radius, size} should be renamed to pv.Dot#{shapeAngle, shapeRadius, shapeSize}
4. Upgrading to V2: A
5. Fiddles: A
6. V1 extension point reference: A
Environment
1. «Where should I put that piece of code?»: A, B, C, D, E, F
2. Using CCC in multiple environments: A, B
3. Debugging: A
  1. For debugging layout issues, specify a debugLevel of 16 or greater.
4. V1 compatibility mode: A
Object model
1. Extension points: A, B, C
2. Visual API: A
3. «Something like "this.scene.vars.series" means absolutely nothing to me»: A, B, C
4. atoms vs vars: A
Interaction
1. Handling the click action: A, B, C, D
2. Handling the selectionChanged action: A, B, C, D
3. Handling right-click and other protovis mark-level events: A
4. Fast resize-re-render: A, B
5. In memory data-reload and re-render: A
6. First render with some datums (~rows) already selected: A
7. First render (sort of) with some series hidden: A
Tooltip
1. See also "Data, Dimensions" for label, default value format and hidden dimensions.
2. Automatic data summary tooltips: A
3. Types of automatic tooltips on cartesian axis: A
4. Showing extra non-visually represented dimensions: A
5. Custom, showing column and row percentage: A, B
Data source translation
1. Relational vs Crosstab formats: A
2. What's the expected data format? .....
3. Virtual item structure: A, B, C, D
4. Many ways to specify "readers": A, B
5. CDE postFetch transformations and statistics:
  1. Sort by value and TOP N: A
  2. Multiple charts, one data source: A
  3. Filtering rows: A
  4. Calculating range of column: A
  5. Counting categories or series: A
Data, Dimensions
1. Explicitly defining dimensions: A
2. Changing default labels of dimensions: A
3. Default format for a value: A, B
4. Changing sort order of a dimension: A
5. Hiding a dimension from the user: A
6. Calculations; "dataPart" dimension and visual role; custom partitioning of data between the main plot and plot2: A
7. Data aggregation in categorical charts: A, B, C
8. Specifying trending options: A
Visual Roles, Axes
1. Reverse the values of a visual role (category, series, color, ...): A, B
2. Color
  1. Fixed colors for some series values: A, B
  2. Gradient colors: A, B
  3. Changing default color scheme: A
3. Cartesian
  1. Second plot with a second Y axis: A
  2. Y axis with a log scale: A
  3. Generating integer ticks: A
  4. Date ticks formatting: A, B, C
  5. Hiding cartesian axis' rule: A
  6. Centering discrete axis grid-lines: A
  7. Composite/Hierarchical axis (like that of the HeatGrid chart): A, B, C
  8. Controlling tick label overlapping: A, B
  9. Summary of scale and tick affecting options: A, B
4. Remove the frame/rectangle around the plot: A
Value labels
1. Changing text: A, B
2. Automatic color depending on background: A, B, C
3. Hiding or trimming when inside and don't fit: A, B
Charts
1. Bar
  1. Styling bars: A
  2. Changing bar colors dynamically: A, B, C
  3. 3-D style: see "Axes > Gradient Colors"
  4. Simultaneously Stacked & Grouped: A
2. Line
  1. Disguising interpolated segments: A
3. Bullet ....
4. CDE Dial ....
5. CDE Protovis ...
Walkthroughs
1. "Paired Bar-Line Measures" site example explained: A
2. "Simulating" a statistical dot plot: A
3. "Stacked bars on a composite axis": A

↧

upload files to ftps creating connection timeout

April 10, 2014, 9:39 pm

≫ Next: Dimension with non hierarchic Levels/Members

≪ Previous: CCC FAQ: Frequently Asked Questions About CCC

Hii All,
From last few days i am getting problem with FTP server related issues, I have created 1 job for which i have to upload csv file to ftp server, so i am using upload files to ftps step in job, from last 2 months it was working properly but from few days after every few minutes it is giving me connection timeout error, if i restart the spoon or just try to connect by placing test connection option then it will start working again properly..

So how can i figure-out the problem or how can i resolve the issue?

because at the specific time when job has to execute it is not executing and files are not transferring to server.

↧

Dimension with non hierarchic Levels/Members

April 11, 2014, 1:39 am

≫ Next: Auto pagination at Table Component

≪ Previous: upload files to ftps creating connection timeout

Hi folks,

suppose you have a dimension like Car. Some properties of a car can be arranged hieratchical, eg:

DIM_CAR:

Hierarchy (default)
- Brand
- Series
- Type

But how to model all the additional Properties. Maybe I have something like

SteeringWheel (L,R)
NumWheels (3,4,6,8)
Color
etc.

I can not place them in the same hierarchy as Brand, Series, Type because it makes no sense.

I can built additional hierarchies in my Car Dimension:

DIM_CAR:

Hierarchy (CarType)
- Brand
- Series
- Type
Hierarchy (SteeringWheel)
- SteeringWheel
Hierarchy (Color)
- Color

But now I can not use more then one Hierarchy in a query. And maybe I want to do exactly that: Use the Brand as Filter, Select Color on and Steering Wheel on Columns and my Sales Fact on Rows.

So what would be the best way to model something like that? Dimensional Properties, which can not be placed in a single hierarchy?

Thanks and Regards

↧

Auto pagination at Table Component

April 11, 2014, 4:06 am

≫ Next: greedy step wise algorithm

≪ Previous: Dimension with non hierarchic Levels/Members

Hi Forum,

has anybody an idea how I can implement a automatic pagination (maybe after 30 sec) at the Table Component?

Thank you for your help :)

↧

greedy step wise algorithm

April 11, 2014, 6:09 am

≫ Next: Read user's session variables from a query

≪ Previous: Auto pagination at Table Component

hi,
I found greedystepwise algorithm create a ranked list.
Can someone tell me what this ranking list is based?
it is based on merit or something else?
thanks

↧

Read user's session variables from a query

April 11, 2014, 7:21 am

≫ Next: enlarge dashboard

≪ Previous: greedy step wise algorithm

Hi,

I need to use inside a query a variable stored in the user's session by an xaction with custom component. The Custom component's class pratically do this:

@Override
protected boolean executeAction() throws Throwable {
System.out.println("CrikhetComponent - executeAction");
String username = null;

if (isDefinedInput("username")) {
username = getInputStringValue("username");
}

//chiamata al WS
System.out.println("chiamata al ws, user:" + username);
setOutputValue("userUrl", "abc_" + username);
return true;
}

So my question is, what's the correct syntax for retrieve the userUrl value inside the query?

↧

enlarge dashboard

April 11, 2014, 8:24 am

≫ Next: Uncoherent values in maxheight of a group causes Assert Shift is not as expected

≪ Previous: Read user's session variables from a query

Hi,
i'm creating some dashboards using cde,but at the moment i have a problem.
All my dashboards have some space on the left and on the right, is there any way to remove it?

this is a screenshot of one of my dashboards

thanks
dashboard.jpg

Attached Images

dashboard.jpg (16.3 KB)

↧

Uncoherent values in maxheight of a group causes Assert Shift is not as expected

April 11, 2014, 8:43 am

≫ Next: The 3 pillars for Pentaho Community Strategy

≪ Previous: enlarge dashboard

Hy,

I have a report where on content row is really important and also is higher than the current page could display.
I was expecting to see the rest of my content on a second page.
Yet I have this error :

Code:

org.pentaho.reporting.engine.classic.core.InvalidReportStateException: Assert: Shift is not as expected: realY=117880900 != expectation=116980900; Shift=44409600; AdditionalShift=-900000; RealShift=43509600

    at org.pentaho.reporting.engine.classic.core.layout.process.PaginationStepLib.assertBlockPosition(PaginationStepLib.java:162)

    at org.pentaho.reporting.engine.classic.core.layout.process.PaginationStep.handleStartBlockLevelBox(PaginationStep.java:16

I made some search without success. I tried with page_break options, etc.... no more success.
Some post are talking about fixed width....

What I observed is that in the max-height of my details I have this ihnerited value : 4.5035996E15
When I set a fixed value like 200 or 300, the line is truncated but displayed

↧

The 3 pillars for Pentaho Community Strategy

April 11, 2014, 9:30 am

≫ Next: insert select max values in al list

≪ Previous: Uncoherent values in maxheight of a group causes Assert Shift is not as expected

Almost one year on

It's been one year since Webdetails joined Pentaho. For me it seems much less than that! (even though if you ask the people that have to put up with me, I'm sure they'll say it feels like decades :p )

Throughout this time, we've been (re)defining a strategy for the Community - Yes, I actually have to do more work other than flying out to different countries in the world to have beers with interesting people. And it's a great sign to see that there's much more events than the ones I can attend.

PCM13 group photo

Pentaho Community Strategy

The goals

Since the beginning, I have pretty much identified the goals for our community strategy. We want the community to be:
A place for developers

While our Enterprise offering is positioned as a production ready, enterprise scale solution, our open edition is aimed to gain the affection of the developer persona, one that's looking for an open source alternative that allows him to fulfill his goals.
An engagement mechanism

If we make it easier to bring those developers, consultants and general users on-board, they become potential contributors willing to help us building a richer ecosystem around Pentaho. If people spend their time making the tools better, competing between and helping each other, they won't even look at any competitor. They will work in order to increase the quantity and quality of the Pentaho world.
A QA environment

The more users we have using our software, the better it will become. People will exercise the codebase under several conditions allowing us to collect more feedback that will translate into a better and more stable product for the ones that pay our salary: our customers.
An evaluation territory

Every user is an evaluator and a potential subscriber. So the open edition has to be a quality release, the best we can possibly achieve.
An upsell mechanism

With the huge amount of installs, our open edition has a great potential as a lead generator that we need to take more advantage of.

I'm a huge fan of being open and honest. Even though we are, in our core, an open source company, we want to make as much money as the next guy! And I'm a firm believer that through open source, not only we'll get there much faster, we're also doing "the right thing". Whatever that means - but at least it feels that way to me!

The strategy: The 3 pillars

We actually have a plan to get there. Our strategy is based on 3 well identified components:

Model
Facilitator
Distribution

Model

The Project Maturity Classification is the central piece on a strategy that aims to use the Community as a vehicle to reach our goals faster.

The two-lane approach aims to promote innovation, clear out what is and what isn't supported and which levels of support attached to them, and clarifying to prospects / customers/ community/ etc how Pentaho leverages it's Open Source roots and how that puts us to a huge advantage compared to the rest of the competition.
Facilitator

If done correctly, the community can be a huge source of innovation.

And in order to promote that innovation, it has to be simple to build extensions to our product. The large amount of examples on PDI makes it simple for developers to develop new plugins and Sparkl, or Pentaho Application Builder aims to make it possible for consultants to develop those extensions on the BA server.
Distribution

The Marketplace is the main distribution vehicle, also acting as a major motivation for partners and other companies to create these extensions. The visibility they gain drives a lot of business opportunities.

It will also allow us to apply the early and often rule, giving users the ability to try early preview releases of our projects / plugins, acting as an extended QA team.

----------

Will it work? We'll see - If not, we'll try something else. Eventually, something will :)

-pedro

More...

↧

insert select max values in al list

April 11, 2014, 9:55 am

≫ Next: JDBC security and Operation Permissions on groups

≪ Previous: The 3 pillars for Pentaho Community Strategy

Hi All,
I've a problem that i cannot solve..
I have a list and i must add for every rows a new fields with select max+1 value of a table, what can i do?
If I try to get value of select max with "execute query script" (for each row) transformation show error for ORA-01000 maximum open cursors exceeded.

help me please

↧