Quantcast
Channel: Pentaho Community Forums
Viewing all 16689 articles
Browse latest View live

How to open xlsx files ?

$
0
0
I m trying to input xlsx files using "Excel 2007 XLSX (Apache POI)" and "Excel 2007 XLSX (Apache POI Streaming)" but it doesnt work.

Can someone help me ?

Pentaho User Console Double Counting

$
0
0
Hello,

I have an analyzer report. I am also able to drill down on this report.

When I look at 2015 year data and 2016 year data, both have exactly the same data. When I click to drill down, both years are in the same drill down report. I have attached an image in case anyone wants to see.


1. Why is this happening?
2. How to fix this?
Attached Images

Possible combinations of strings from two Excel input columns

$
0
0
Hi,

I've only recently started using this program, and I'm still struggling. I have to create a model that outputs all the possible combinations from 2 Excel columns(strings) in another sheet. I have an Origin column with 30 factors, and a Destiny column, with 50 factors. So, there should be 1500 combinations total. I've tried using the Join Rows Step, but get an empty sheet as an output. I don't know if its because you can't use it with strings.

I'm really lost, and would really appreciate your help!!

Need help with ETL to re-insert data on the same table (auto-reference fields)

$
0
0
Hello,


Actually i have a MYSQL table ISSUES with the follow columns: id, parent_id, root_id, desc and project_id. parent_id and root_id references the same table (issues), id column.
The project_id for the new lines i get by joining with an auxiliar projects_from_to table.
The root_id for issues of one project is the id of the issue that parent_id is NULL for that project.
And the parent_id for the issues is the previous issue_id for one project (except the first issue that parent_id is NULL).

Initially it has the follow data:


(ID, parent_id, root_id, desc, project_id)
1, NULL, 1, test1, 100
2, 1, 1, test2, 100
3, 2, 1, test3, 100
4, NULL, 4, test4, 300
5, 4, 4, test5, 300


Using PDI i need to re-insert this 5 lines so i get the final result:


(ID, parent_id, root_id, desc, project_id)
1, NULL, 1, test1, 100
2, 1, 1, test2, 100
3, 2, 1, test3, 100
4, NULL, 4, test4, 300
5, 4, 4, test5, 300

6, NULL, 6, test1, 200
7, 6, 6, test2, 200
8, 7, 6, test3, 200
9, NULL, 9, test4, 600
10, 9, 9, test5, 600


If someone could give me a hint or show me how can i load the parent_id and root_id for the 5 new lines (that are based on the 5 first lines) i would appreciate.


One cenario is where i use table input and select the 5 lines setting parent_id and root_id as NULL, keep the desc value and insert on a table output (ID is auto increment).
And after that i load a issues_from_to table that would be like this:


(id_from, id_to)
1, 6
2, 7
3, 8
4, 9
5, 10

Maybe i could use the data of this auxiliar table to help me load the parent_id and root_id of the 5 new lines.

Can anyone help me?

Trouble with XMLs | Kettle 6.0.1.0

$
0
0
I am using the XML input step to read some XMLs. The XMLs are originally zipped and are unzipped by Pentaho in the previous step using Java Code.
Occasionally, an error is thrown which causes the complete job to halt.

org.dom4j.DocumentException: Error on line -1 of document : Premature end of file. Nested exception: Premature end of file.


Error on line -1 of document : Premature end of file. Nested exception: Premature end of file.org.dom4j.DocumentException: Error on line -1 of document : Premature end of file. Nested exception: Premature end of file.
Error on line -1 of document : Premature end of file. Nested exception: Premature end of file.

I have tried opening the file manually as soon as the error occurs, and it opens without any problem. I am even able to modify and save it.
Initially, I tried writing a custom Java class that would validate the XML file and then pass it on to the XML input step. However, the XML is always readable using the Java Code.

Could someone help out.
Thanks.

Store email in SENT Folder

$
0
0
When I send an email from Pentaho email component using the credential, I want to see the sent email in my SENT folder as well. Is there any way to achieve this feature from Pentaho email?

How to capture file name at run time and Store in variable and write to output file

$
0
0
Hi,
I am new to Pentaho Data Integration, Please let me know how to capture file name at run time and store in variable and write processed file name to output file.

How to use jsonPath in Modified Java Script

$
0
0
I'm trying to achieve something like this but I'm getting undefined error. I could see jsonpath-1.0.jar file is lib folder. Please suggest.
var o = '{ { "store": {
"book": [
{ "category": "reference",
"author": "Nigel Rees",
"title": "Sayings of the Century",
"price": 8.95
},
{ "category": "fiction",
"author": "Evelyn Waugh",
"title": "Sword of Honour",
"price": 12.99
},
{ "category": "fiction",
"author": "Herman Melville",
"title": "Moby Dick",
"isbn": "0-553-21311-3",
"price": 8.99
},
{ "category": "fiction",
"author": "J. R. R. Tolkien",
"title": "The Lord of the Rings",
"isbn": "0-395-19395-8",
"price": 22.99
}
],
"bicycle": {
"color": "red",
"price": 19.95
}
}
} }', // the 'store'
res1 = jsonPath(o, "$..author").toJSONString(),
res2 = jsonPath(o, "$..author", {resultType:"PATH"}).toJSONString();

Problem publishing a cube from Workbench

$
0
0
Hi all, this is my forst post.
I´m new in Pentaho. I creted a schema and a cube in workbench and try publish it to User Console but I'm getting this error:

com.sun.jersey.api.client.ClientHandlerException: java.net.SocketException: Unexpected end of file from server
at com.sun.jersey.client.urlconnection.URLConnectionClientHandler.handle(URLConnectionClientHandler.java:151)
at com.sun.jersey.api.client.filter.HTTPBasicAuthFilter.handle(HTTPBasicAuthFilter.java:104)
at com.sun.jersey.api.client.Client.handle(Client.java:648)
at com.sun.jersey.api.client.WebResource.handle(WebResource.java:680)
at com.sun.jersey.api.client.WebResource.access$200(WebResource.java:74)
at com.sun.jersey.api.client.WebResource$Builder.put(WebResource.java:537)
at org.pentaho.mondrian.publish.PublishToServerCommand.publish(PublishToServerCommand.java:319)
at org.pentaho.mondrian.publish.PublishToServerCommand.execute(PublishToServerCommand.java:220)
at org.pentaho.mondrian.publish.workbench.MondrianPublish$1.actionPerformed(MondrianPublish.java:87)
at javax.swing.AbstractButton.fireActionPerformed(Unknown Source)
at javax.swing.AbstractButton$Handler.actionPerformed(Unknown Source)
at javax.swing.DefaultButtonModel.fireActionPerformed(Unknown Source)
at javax.swing.DefaultButtonModel.setPressed(Unknown Source)
at javax.swing.AbstractButton.doClick(Unknown Source)
at javax.swing.plaf.basic.BasicMenuItemUI.doClick(Unknown Source)
at javax.swing.plaf.basic.BasicMenuItemUI$Handler.mouseReleased(Unknown Source)
at java.awt.Component.processMouseEvent(Unknown Source)
at javax.swing.JComponent.processMouseEvent(Unknown Source)
at java.awt.Component.processEvent(Unknown Source)
at java.awt.Container.processEvent(Unknown Source)
at java.awt.Component.dispatchEventImpl(Unknown Source)
at java.awt.Container.dispatchEventImpl(Unknown Source)
at java.awt.Component.dispatchEvent(Unknown Source)
at java.awt.LightweightDispatcher.retargetMouseEvent(Unknown Source)
at java.awt.LightweightDispatcher.processMouseEvent(Unknown Source)
at java.awt.LightweightDispatcher.dispatchEvent(Unknown Source)
at java.awt.Container.dispatchEventImpl(Unknown Source)
at java.awt.Window.dispatchEventImpl(Unknown Source)
at java.awt.Component.dispatchEvent(Unknown Source)
at java.awt.EventQueue.dispatchEventImpl(Unknown Source)
at java.awt.EventQueue.access$500(Unknown Source)
at java.awt.EventQueue$3.run(Unknown Source)
at java.awt.EventQueue$3.run(Unknown Source)
at java.security.AccessController.doPrivileged(Native Method)
at java.security.ProtectionDomain$1.doIntersectionPrivilege(Unknown Source)
at java.security.ProtectionDomain$1.doIntersectionPrivilege(Unknown Source)
at java.awt.EventQueue$4.run(Unknown Source)
at java.awt.EventQueue$4.run(Unknown Source)
at java.security.AccessController.doPrivileged(Native Method)
at java.security.ProtectionDomain$1.doIntersectionPrivilege(Unknown Source)
at java.awt.EventQueue.dispatchEvent(Unknown Source)
at java.awt.EventDispatchThread.pumpOneEventForFilters(Unknown Source)
at java.awt.EventDispatchThread.pumpEventsForFilter(Unknown Source)
at java.awt.EventDispatchThread.pumpEventsForHierarchy(Unknown Source)
at java.awt.EventDispatchThread.pumpEvents(Unknown Source)
at java.awt.EventDispatchThread.pumpEvents(Unknown Source)
at java.awt.EventDispatchThread.run(Unknown Source)
Caused by: java.net.SocketException: Unexpected end of file from server
at sun.net.www.http.HttpClient.parseHTTPHeader(Unknown Source)
at sun.net.www.http.HttpClient.parseHTTP(Unknown Source)
at sun.net.www.http.HttpClient.parseHTTPHeader(Unknown Source)
at sun.net.www.http.HttpClient.parseHTTP(Unknown Source)
at sun.net.http://www.protocol.http.HttpURLConn...tream0(Unknown Source)
at sun.net.http://www.protocol.http.HttpURLConn...Stream(Unknown Source)
at java.net.HttpURLConnection.getResponseCode(Unknown Source)
at com.sun.jersey.client.urlconnection.URLConnectionClientHandler._invoke(URLConnectionClientHandler.java:249)
at com.sun.jersey.client.urlconnection.URLConnectionClientHandler.handle
(URLConnectionClientHandler.java:149)
... 46 more

Another error I found on …\Pentaho\server\biserver-ee\logs\pentaho.log is: “ERROR [AnalysisDatasourceService] Error putMondrianSchema MondrianCatalogHelper.ERROR_0004”, I don´t know if it is related.

Best regards,

Ivan

Pentaho Data Integration in Java Application without using Maven

$
0
0
Is it possible to embed Pentaho DI into my java application without using maven? (e.g. through import of jars in the library) I'm using Netbeans as my IDE and I've already created my application without the use of maven so I'd like to know whether it is still possible to integrate Pentaho DI, since most of the solutions I've seen uses dependencies? If yes, how can I do it? I just want to simply execute a job file (made via Spoon) on button click.

Additional info: in my Job file the transformations in it access tables from mySQL database. The file works perfectly fine and does what's intended to do. I just want to be able to execute the job in my java application.

PCM16 - Pentaho Community Meeting, coming up Nov 11-13

$
0
0
2 weeks to go. For the 9th year in a row, this fantastic event will take place in less than 2 weeks in Antwerp. See you soon!!



Now, let me shamelessly copy the blog post from Bart Maertens, the organizer of the event:



Announcing #PCM16, Antwerp, Belgium!!

Register now!
Friday, Nov 11th: PCM16 Hackathon
Saturday, Nov 12: PCM16

Submit your talk proposal!

Use case room: pcm16_biz@know.bi
Tech room: pcm16_tech@know.bi
AGENDA: What's cooking a PCM16? Have a look:
ANTWERP: How to get there and things to do in
After the 2015 edition in London, the eighth yearly Pentaho Community Meeting will be back where it was in 2014: Antwerp, Belgium. The dates for PCM16 will be Friday, November 11th and Saturday, November 12th. The venue for this edition will be the gorgeous medieval hospital and monastery Elzenveld. The views may not be as spectacular as they were in the 2013 Sintra edition, but the location sure will be fine!






As was the case in the 2014 and 2015 editions, we’ll have a hackathon (followed by drinks) on Friday evening and two presentation rooms (business and technical) on Saturday.
As has been the trend in the latest PCM editions, we aim to make this the European Pentaho event of the year for both Enterprise and Community Edition users. After all, no matter which version you use, we’re all just a community of Pentaho users.
The event is free of charge because of sponsorships by know.bi and Pentaho, there will only be a small charge (€10) for lunch which you will kindly be asked to pay in cash when registering on Saturday.
In return for a weekend of your time, you’ll enjoy a couple of days of being submerged in everything Pentaho, (Big) Data, Data Science and the excitement of talking to and working with the community involved in all of this.
Registrations are open now, register on our eventbrite pages for the hackathon and PCM:
Friday, Nov 11th: PCM16 Hackathon
Saturday, Nov 12: PCM16
Friday, November 11th

On the evening of Friday, November 11th, we’ll be hosting a hackathon. People will have to travel to Antwerp, therefore we won’t be able to start early (8PM-ish) and have a hackathon for hours on end. However, as has been shown in the previous years, a couple of hours suffice to build and present impressive solutions with PDI, Mondrian or CTools.
As tradition has it, beer is an important part of a pre-PCM Friday evening, and there’s nowhere better to go for beers than Belgium! There are quite a number of pubs in the vicinity of the venue: ‘K. Zeppos’, named after the -at least in Belgium- world famous sixties TV series ‘Kapitein Zeppos’ and ‘Pallieter’, named after a 1916 novel by Flemish writer Felix Timmermans, just to name a few.
After a couple of hours of hacking, this is the perfect excuse to enjoy some of our famous Belgian beers. Take it easy though, these are not Amstel or Heineken!
Saturday, November 12th

The rooms

Traditionally, Saturday is what a true PCM is all about!
Just like in the last editions, there will be two rooms: business and technical.
The business room will be your goto place for use case presentations where Pentaho customers and/or users explain what real life problems they are solving with the Pentaho suite.
Some of Pentaho’s biggest and most prestigious implementations will be presented here, alongside smaller but not less interesting implementations.
The technical room is for more technical presentations. This is the ‘old school’ Community Meeting room, and your goto place to find out what’s cooking within Pentaho and the Pentaho Community. Powerpoint used to be forbidden, just like beaming code on the big screen was mandatory in the earliest Pentaho Community Meetings, find out for yourself if this still stands.
Call for Speakers

A list of speakers for both rooms is currently being compiled.
If you’d like to present, please mail to pcm16_biz@know.bi or pcm16_tech@know.bi with a short description of your presentation proposal and we’ll get back to you asap.
Sunday, November 13th

Again, not intending to change a winning team and sticking to tradition, we’ll have a social activity on the post-PCM Sunday.
An agenda is still being compiled, but we’ll post regular updates here, so stay tuned for more.
How to get there

By Plane
When you're arriving in Belgium through Brussels Airport, there are direct trains from the airport to Antwerp (approximately 30 minutes).
A (limited) number of cities have direct connections to Antwerp Airport, which is just a 15 minute taxi ride from the city center.
By Train
Antwerp has connections to several European cities, including a number of high speed connections. Check out the Belgian Rail website for more details.
By car
Follow your GPS to you hotel's address. Driving in Antwerp is fine, but traffic around the city can be challenging ( slow).
Agenda

Technical Room


Use Cases






More...

Pentaho 6.1 Permission for Users based on LDAP / Active Directory Groups

$
0
0
Hi all,

I have Pentaho 6.1 connected to the Active Directory.
I am able to set permissions for users, e.g.
File 1 - accessible for user1
File 2 - accessible for user2

Now I want to assign permissions based on the groups the users belong to in AD. e.g.:
Group1: user1, user3
Group2: user2, Group3
Group3: user4
File 1 - accessible for all users, who belong to Group1

In my understanding I can to that by sharing the files with the roles I get from my AD. But in fact that doesn't work.
Can anyone please give me a hint?

Thanks a lot!

Here is the content of my applicationContext-security-ldap.properties


contextSource.providerUrl=[url]
contextSource.userDn=[user]
contextSource.password=[pw]

userSearch.searchBase=OU=[...]
userSearch.searchFilter=(\&(sAMAccountName=\{0\})(memberOf:1.2.840.113556.1.4.1941:=CN=app_pentaho))

populator.convertToUpperCase=false
populator.groupRoleAttribute=cn
populator.groupSearchBase=OU=[...]
populator.groupSearchFilter=(member:1.2.840.113556.1.4.1941:=\{0\})
populator.rolePrefix=
populator.searchSubtree=true

allAuthoritiesSearch.roleAttribute=cn
allAuthoritiesSearch.searchBase=OU=[...]
allAuthoritiesSearch.searchFilter=(\&(objectClass=group)(memberOf:1.2.840.113556.1.4.1941:=CN=app_pentaho))

allUsernamesSearch.usernameAttribute=sAMAccountName
allUsernamesSearch.searchBase=OU=[...]
allUsernamesSearch.searchFilter=(samAccountType=805306368)

adminRole=CN=app_pentaho_admins
adminUser=CN=[myadmin]

Best way to update XML attributes?

$
0
0
Hello,

I want to replace an attribute value with another in the XML file. I tried using RegEx in sed but it didn't work properly. I couldn't find anything promising in Pentaho which can do the replacement of the attribute values. Does anybody has a plugin or a url to that plugin that can Update the XML?

Processing in parallel with thread limit

$
0
0
I have many tables to load to the staging area. I want to process them in parallel but in the order do not overload system I want to limit how many processes to run in the same time based on the capacity of current configuration. if there is something like in SSIS when you design all such processes to run in parallel but just limit them and job select next one in line when one of previous is finished.

Splitting text file at specific strings

$
0
0
Hello

I want split a text file at specific strings. The data file is structured with several sub-headings such as:
%T TASK
%T POBS
%T UDFTYPE

with data in between these headings. The data is tab delimited with a different number of fields within each section.
For example:

%T UDFTYPE
%F udf_type_id table_name udf_type_name udf_type_label logical_data_type super_flag indicator_expression summary_indicator_expression
%R 128 TASK user_field_128 CL Start Chainage FT_INT N
%R 129 TASK user_field_129 CL Finish Chainage FT_INT N
%R 130 TASK user_field_130 CL Bar Style FT_INT N
%R 131 TASK user_field_131 CL Color FT_INT N
%R 135 TASK user_field_135 CL Bar Text FT_TEXT N
%R 136 TASK user_field_136 CL Location Offset FT_TEXT N
%T POBS

So the first step is to split the file into these sections and ideally include the %T value in the successive rows until the next sub-heading

If anyone can help, I would be very grateful

thank you


Andrew

Problems accessing user console

$
0
0
Hi,

User Console does strange things. Sometimes, you accessed with a user X and has access permissions of other Y. At other times, login with Admin, in the header appears AnonymousUser. Even it is given the case of access as a business user role and have administrator access. This is a serious problem o security.
I think it may be a problem session cache or similar. If the browser is closed, it is usually solved.
What can happen?
Regards.

How to create a new kettle database repository ...Not able to figure out id password

$
0
0
How to create kettle repository..
Don't now id password
while connecting it is giving unauthorized access error.

Reading SPSS(.sav) file from pentaho Kettle

$
0
0
Hello there,
You might already heard the same question "How to read SPSS(.sav) file with pentaho kettle?", But i did'nt find any answers, all the questions are dated back to 2011. So if there is any development on this matter please reply.

Download CSV URL

$
0
0
Hi,

I use the "Transfer a file using HTTP..." control to download a couple of CSV files and write them into a database in a later step. It works perfectly fine when the URL opens the CSV file immediately, i.e. if you put the URL I use into a browser, the CSV values would be displayed in it. The control itself is configured to put the CSV content into another CSV (Target file).

Now, I have a different set of URLs which wouldn't display the CSVs in the browser if put into the address bar, but start to download them. I don't know how to handle these. I thought I might be able to use the same control and configure the header tab by entering a MIME type, but it didn't work out. Any suggestions?

executing batch files

$
0
0
Can we create exe or batch files to execute transformations and jobs.

My doubt is inorder to do this, what is the pre requesite software required. Also tell me if the code is one system and the batch files are in another system, is it possible to execute now? Please let me know as soon as possible.
Viewing all 16689 articles
Browse latest View live