Quantcast
Channel: Pentaho Community Forums
Viewing all 16689 articles
Browse latest View live

Shell script as input source

$
0
0
How can I stream the output of a shell script as the input source to a PDI transformation? Basically to use the output of a shell script instead of, say, a database query.

I know I could execute the shell script in a job, saving the output to a flat file in one step, and then use the flat file as the input to a transformation in a separate step. But that's so 90s! Is there any way to achieve what I attempting? Maybe some custom input plugin??

Thanks,

Geoff

Error logging

$
0
0
Hi All,

Is there any way to log the error lines to a column in a table.
I saw in transformation properties, there is a way to log the entire logs into a column.
But i need only error lines to log into the table.

Could anyone help please???


Thank you in advance!!

Unable to connect in JNDI while creating new Data source in bi server

$
0
0
I am getting the below error
ConnectionServiceImpl.Error_0009 Connection to database[BI] failed
I am using Pentaho BI server community edition 5.2.0 version and i am trying to use JNDI connection where i am getting the error.
For JDBC connection it worked fine
I have defined the JNDI credentials inside the
"biserver-ce-5.2.0.0-209\biserver-ce\pentaho-solutions\system\simple-jndi\jdbc.properties"
Please help me on this

Ho to inprove the provemance?

$
0
0
Hi,

I used the Get File Name + CSV File Input to read the data , but it spent more time.

As 15788 files returned by Get file name.


when i change the copies to 10 in the CSV input, the reading time is not changed very much.


can anyone give me the correct solution?

What is the current version and issue tracker?

$
0
0
I have an application using Mondrian 3.9. At the same time I know that 4.x has been around for a while, so I wanted to give it a shot. I got really confused though:

Downloads at http://community.pentaho.com/projects/mondrian/ list Mondrian 3.10.0 Stable as the latest official release.

In Maven repo (http://repository.pentaho.org/conten...taho/mondrian/) the latest build (not counting 4.4-SNAPSHOT) is 4.3.0.1-130. This is the one I'm trying to run.

I found a few bugs in this one. Some I can work around (https://github.com/pentaho/mondrian/issues/439). Others are a showstopper (https://github.com/pentaho/mondrian/issues/454).

I see that issues on GitHub don't really get much attention, so I wanted to try reporting in JIRA. To my surprise, its list of "Affected Versions" is even more puzzling. It has a large number of 5.x versions, up to 5.4.0.1 GA. There is a number of 4.8.x versions. There are also things like "4.0.0 (5.1.0 GA Suite Release)" and "3.5.x (4.8.2 GA Suite Release)". Nothing between 4.1 and 4.7 (inclusive).


All in all, this is extremely confusing and frustrating, if you can't even figure out what the current version of the software is!

Now, my questions:

1. What is the latest version of Mondrian CE that is considered stable, usable and production-ready?
2. What is the status of the 4.x and 5.x versions? What are they?
3. Which issue tracker should be used for reporting issues in Community Edition?

Regex in Replace in string step

$
0
0
Hi All,

I tried to replace a string using regular expression.

I want to replace the string abcd20150607.txt to abcd.txt
I tried with SELECT REGEXP_REPLACE('abcd20150607.txt', '[[:digit:]]') FROM dual; an got output.
When i replaced [[:digit:]] in Replace in string-search string, i got wrong output , it is not replacing the digits

Can anybody help?

PDI Logging : In Search of relational schema and/or practical queries

$
0
0
Community,

I've activated logging from all angles of my job(s) and transformation(s). We are not using a repository, so I am left with
a handful of tables I am trying to join together to make sense of things.

I have two primary challenges :

1. I am struggling to reverse engineer a single SQL statement joining up multiple tables properly....based on channel_id
2. Contents of the various tables show up at different times...making it difficult for me to understand which tables
can be used intra-process and which are best left for inter-process (after the job completes/exits).

If I had a schematic of the PDI logging schema/tables, with keys I could solve (1). If I could functional queries used by others
I could tackle (2).

Happy to share what I come up with - if it hasn't already been shared - if someone would like to pitch in.

Thanks,

Michael

install without hitting enter ? (pentaho installation automation)

$
0
0
Hi I am working on automating installation of Pentaho PDI, I would like to know if there is any way that I could install Pentaho PDI without hitting enter and typing Yes in the command line ?

Mysql DB with PDI CE 5.3 java 1.8 unable to load 3900000 records

$
0
0
Hi,
mysql database with PDI CE 5.3 java 1.8 with jvm size(6gb)

I am trying to load data 3900000 records with in the same database using different scenarios because of to avoid java heap space issue. for that i have used Java script logic to iterate the job for every limit range(1 , 100000), attached java script (wh.txt) as well, could you please suggest best approach.

scenario 1: Table Input -> Insert/Update
in this case job is iterating but flailing at 3400000 records (java heap space issue)

scenario 2: Table input -> Table Output
in this case , not able to iterate the job every time it is taking limit range(1 , 100000), seems to be java script logic will not iterate at Table output step.

scenario 3: Table Input ->(few logic) -> synchronization after merge
in this case also job is iterating but flailing at 3400000 records (java heap space issue)
wh.txt
insert update.png
Sync merge.jpg
Attached Images
Attached Files

Limit range iteration in Mysql database with PDI CE 5.3

$
0
0
Hi,

I am trying to iterate the job with limit range 0, 1000000 using java script in PDI CE 5.3 , instead of java script afe we have another approach to iterate the job with limit range 0, 1000000.

I have verified calculater samples .trn but not able to get my logic, could you please help me.

PDI CE 5.3
MYSQL
JAVA 1.8

java script logic is here wh.txt
Thank you
Attached Files

Pentaho PRD 5.3 Crosstab Row and Column Summary with Grand Total

$
0
0
Product Jan Feb Product Total
A 1000 2000 3000
B 3000 4000 7000
Monthly Total 4000 6000 GrandTotal: 20,000








Hello All!

I am trying to get a table summary for a few products sales over 12 months. The expected output demonstrated as above.
My data model are in row format, SQL GROUP by Product, Month, The table was constructed by crosstab in PRD 5.3, with Summary checkboxes (Row and Column) checked.

I can see the row summary (sum across) is working, however the Monthly Total for Jan Feb etc.. are not showing up. Also How do I do a Grand Total (sum of the row and col sums ? )


Thanks for any information or work-around

Ran

ERROR while trying to use jndi connection

$
0
0
Still i am getting error
I am getting the below error
ConnectionServiceImpl.Error_0009 Connection to database[BI] failed
I am using Pentaho BI server community edition 5.2.0 version and i am trying to use JNDI connection where i am getting the error.
For JDBC connection it worked fine
I have defined the JNDI credentials inside the
"biserver-ce-5.2.0.0-209\biserver-ce\pentaho-solutions\system\simple-jndi\jdbc.properties"
Please help me on this


I have followed the following steps


  • Stop the Tomcat and BA servers.
  • Edit the /tomcat/webapps/pentaho/WEB-INF/web.xml file.
  • At the end of the <web-app> element, in the same part of the file where you see <!-- insert additional resource-refs -->, add this XML snippet.<resource-ref>
    <description>BI</description>
    <res-ref-name>jdbc/BI</res-ref-name>
    <res-type>javax.sql.DataSource</res-type>
    <res-auth>Container</res-auth>
    </resource-ref>Change the description and res-ref-name nodes, as well as any others that apply to your situation and fit your database. You may need to consulthttp://tomcat.apache.org/tomcat-6.0-...les-howto.htmlto see if there are other things to consider.
  • Save and close the web.xml file.
  • Edit the /tomcat/conf/context.xml with a text editor. Alternatively, you can modify the/tomcat/webapps/pentaho/META-INF/context.xml file if you want this data connection to be available only to the BA Server. Adding JNDI connections to the context.xml makes them available to all of the webapps deployed to this Tomcat instance.
  • Anywhere inside of the <Context> element, add this XML snippet. Then save and close the context.xml file.<Resource name="jdbc/BI"
    auth="Container" type="javax.sql.DataSource"
    factory="org.apache.commons.dbcp.BasicDataSourceFactory"
    maxActive="20"
    maxIdle="5"
    maxWait="10000"
    username="dbuser"
    password="password"
    driverClassName="org.postgresql.Driver"
    url="jdbc:postgresql://localhost:5432/wisdom2412"
    /> This example shows a simple PostgreSQL configuration. Replace the Resource name, username, password,driverClassName, and url parameters, or any relevant connection settings, to match your database connection information and the details you supplied in the web.xml file earlier.
  • Delete the pentaho.xml filed located in the /tomcat/conf/catalina/directory. The pentaho.xml is a cached copy of the context.xml file you modified. Since the cache is not usually configured to update frequently, you have to delete the pentaho.xml file and let Tomcat recreate it when it starts up.
  • Start the Tomcat and BA Server.




PLEASE HELP ME ON THIS>>>>>>>>>>>>>>>>>>>>>

Why is multi-threading a problem in CDE ?

$
0
0
Hi all,

I was just wondering why the multi-threading in CDE is a problem... I mean, in the majority of my work, the update is done before a parameter changed, hence the queries i want to execute are false... A solution i have is to use some javascript (but that doesn't work in all cases) or use alerts (but it's a bit annoying...). In one word, often (and even sometimes randomly), it needs one more refresh to be the thing i want to have. Is there some way to actually sync all the things i'm doing ? Or maybe i'm doing it wrong ?

Thank you for your answers :)

Reading an excel file having multiple languages

$
0
0
I am trying to read an excel file in which there is "Text" column which contains content in Korean / Thai / German / Chinese.
When i try to read the content from this file using "Text file input", it is unable to identify the data format. I believe the issue is with the encoding. Could any one please guide me as to how to handle this situation.

Scatter Plot Chart

$
0
0
Hello,

I'm trying to create a Scatter Plot using Pentaho report designer 5.4.
My table contains 2 columns: duration(BigDecimal) and speed(BigDecimal).

I'm setting x-value-columns=speed and y-value-column=duration, but the graph doesn't have any points.
I verified that the query is correct, does anyone have an idea why I'm getting an empty graph?

Thanks for your help!

Weka Time series analysis - Inventory management

$
0
0
Hi!

I am an economics student planning to write a paper on using weka for inventory management forecasts in retail (grocery shop). The key objective is to see how to use weka to plan future needs. Can you help me with some pages to start with and do you think this will be possible with simulated data (I can't get acces to real data)?

Fixing name in Pentaho Data Integration

$
0
0
i've assigned by my boss in my company to used Pentaho Data Integration/Kettle
how to fixing string data type (example persons name) in which included a special character, into a fixed string data type without special character.
example :


Name Job
4ndy Pr0grammer
Benn! Fin@nce
to

Name Job
Andy Programmer
Benni Finance


thank you :D

Job scheduled doesn't work, bi-server v3.7

$
0
0
Hi,

I have a problem with jobs scheduled in admin console. I had 3 jobs scheduled which worked fine but they were deleted unintentionally.

When I want to add them in Admin console, it doesn't work. When I check the job and click "run", the "last fire time" column is update but nothing happen. I have nothing in log files.

xactions files run from User console work but from admin console don't work.

If you can help me... :)

Thanks a lot.

Dimension Order (Pentaho 4.8)

$
0
0
Hi,

I'm new to Pentaho (and this forum, so apologies if i've asked this question in the wrong location) - and am trying to define the order of dimensions (not the data) within a hierarchy but not having much luck.

I have a hierarchy called Requested Delivery Date.Flat and within that i have 7 dimensions which are (as far as i can tell) being sorted alphabetically. I want to change the order so that they appear in the user console as per the below - i've tried a simple approach, by just renaming the dimensions with both a numeric and alphabetic prefix but this didn't change the order they are in...
Any suggestions on how to go about this?

• Requested Delivery Year
• Requested Delivery Season
• Requested Delivery Quarter
• Requested Delivery Month
• Requested Delivery Week
• Requested Delivery Day
• Requested Delivery Date


Regards,
Gavin

Insert/Update and Synchronize after merge

$
0
0
Hi,

Could you please suggest what is difference mechanism between Insert/Update and Synchronize after merge. I hope both steps are using for insert the values if they are new, update the values if they are existing records in target database. But what is the exact difference?

I tried in google and PDI wiki , They provided the info separately for each step but not the differences, please...
Viewing all 16689 articles
Browse latest View live


<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>