Quantcast
Channel: Pentaho Community Forums
Viewing all 16689 articles
Browse latest View live

Return Site Information via PDI

$
0
0
Hi everyone! :)

I would like to know if Pentaho DI can extract information from any site. I would like to access a website and return a title from a search done on it.

For example: Log in here on the forum and filter all topics titles that contain .txt via Pentaho Data Integration.


Thanks! ;)

Issue with setting up error handling

$
0
0
I fed bad data into csv input step to make it fail and then to capture error in the error logging step. Attached is my ktr and test file.
When parsing fails, the bad data doesn;t flow to the error handling path.

Any help is appreciated. Thanks.

Quote:

Originally Posted by pdi View Post
Thank you very much gutlez. I followed your instructions and the transformation was successful. Here is a screen capture of my steps. The TFI - error handling step shows the rows that were causing the transformation to fail.
However, the Text file output - Error handling 2 step does not contain any error logs. Yet, without it, the transformation fails. I am confused by this phenomenon. If possible, could you please shed some light on this for me.
Again I appreciate your guidance.

Attached Images
 
Attached Files

How to configure logs for carte server?

$
0
0
Hello,
I have setup a carte slave server. i start it using carte.bat "carte-slave-server-config.xml" >> d:\logs\Pentaho.log
Is there any other way to setup the logging (job/transformation related logging info) to automatically rotate every day? I would also like to configure the level of logging (ie Baisc, Detailed etc) If so, could you please let me know how to achieve this.

Env: Pentaho Community Edition 7.1 on Windows.

@johanhammink @marabu -- Any suggestions?

Thanks
Ravi

Upload and Download permission to differnt Users in Pentaho CE 6.0

$
0
0
I want to give upload and download rights to other user in Public folder of Pentaho User console. Right now only admin can publish the reports.The upload option comes in home directory where the folder of user is created. I Want same thing but in public folder.Is there any way where we can give upload option to the users defined.

Merge join step messes up with left outer join?

$
0
0
I use PDI 7 on SQ ver 2016. When I run a merge join with a left outer join where the sorted list 'right' starts earlier in the sort than 'left', it totally messes up and fails to make any successful joins. Who else has this issue and how do you solve it?

REST Client - session management

$
0
0
Hi,

I have to get data from weird API where I need to use http GET then http POST and again http POST (each time server is returning new key).

I noticed that session is important for the server so I have execute those steps in one session.
Question; If I use REST Client(GET) then REST Client(POST) then REST Client(POST) ...all is done in three transformations...then I will have three different sessions, right?

Is there any trick to have same session?

And one more question; I made a working test in Postman where I have the parameters I send to server as body/raw. What is the equivelent of "raw" in REST Client?

Weird Output (text file) Result

$
0
0
Hey everybody!

I'm using Text Output (extension .csv) after all my transformation... But for some reason the beginning shows some XML structure that wasn't suppose to be there. When I click the tab "Preview Data" under the Execution Results window everything looks fine, but when I open the file it's just all messed up. I'm attaching a few images and the file itself.

Any help is welcome....

File Print
Name:  Screen Shot 2018-02-28 at 11.52.35 PM.jpg
Views: 47
Size:  26.5 KB

Output I
Name:  Screen Shot 2018-02-28 at 11.54.05 PM.jpg
Views: 39
Size:  22.1 KB

Output II
Name:  Screen Shot 2018-02-28 at 11.55.22 PM.jpg
Views: 40
Size:  18.9 KB
Attached Images
   
Attached Files

Date Picker in Report Query

$
0
0
Hi All.

I am using following query to fetch data from Postgres

SELECT * FROM Func1('archivevalueTable',ARRAY[${TagName}],'2017-09-06 10:00:00','2017-09-06 18:00:00')
where timetest1>=${StartTime} and timetest1<=${EndTime}

For Tags I am able to pass required info.

But For time i.e. providing StartTime and EndTime I am not getting any data

My parameter details for StartTime and EndTime as follows

Name: - StartTime(EndTime)
Value Type:- Timestamp
Data Format:- yyyy-MM-dd hh:mm:sss
DisplayTime :- Datapicker
Query:-datanew
value:-timetest1

Please find attached
image
Code:

Name:  ERR1.png
Views: 31
Size:  3.5 KB
of time on report.
where my following hard coded query works fine for me.

SELECT * FROM Func1('archivevalueTable',ARRAY['Para1','Para2','Para3'],
'2017-09-06 04:00:00','2017-09-09 21:00:00' ) where timetest1>='2017-09-06 14:00:00' and timetest1<='2017-09-09 21:00:00'


Query Require time to be sent in format :- '2017-09-06 14:00:00'

and I get the data as..

Para1 5.99 06-09-2017 14:00:00
Para2 5.6 06-09-2017 14:00:00
Para3 7.6 06-09-2017 14:00:00
Para1 6.3 06-09-2017 14:15:15
Para2 6.2 06-09-2017 14:15:15
Paar3 8.22 06-09-2017 14:15:15




Please Support

Regards
Attached Images
 

xpath select does not appear to exclude elements

$
0
0
Hello,

I am having trouble with getting Xpath to work in Pentaho 4 running on windows 7.
My goal is to to Select Each F node but exclude the child F nodes. What I am trying
to get to work in Pentaho works on the below xpath tester website.

I want to be able to select the first F node and all elements except for child F nodes.
In my real XML I have child nodes that have the same element name as parent nodes.
I expect to get back f1 and f1a for the top level F. For the second F I expect to get back f2
and the final F to get back f3.

My structure looks like
Code:

Top 
    Next
    F
      f1
      f1a
          F
            f2
            F
              f3


I have tried a LOT of different variations of xpath 1.0 syntax but what I want does not work.
I can only get it to pull the first element below F. In other words it will return the 'f1' element
but nothing else. However when I use the website below with the sample xml
below the website works correctly. But when I add the same thing in Pentaho I just get back 'f1'.
I expect to get back f1, f1a, and gotit for the top level F.

Pentaho step looks like this with the xml below as input
Loop Xpath: /Top
Fields: Name->MyElement, Xpath-> //Next/F/*[not(descendant::F)]
Element: Node
Result type: Single Node

Thank you for any guidance,
Oring

-------
-------
http://chris.photobooks.com/xml/default.htm (or use the xpath tester of your choice)

//Top/Next/F/*[not(descendant::F)]


Code:

<Top>
    <Next>
      <F>
          <f1>f1</f1>
          <f1a>f1a</f1a>
          <F>
              <f2>f2</f2>
              <F>
                  <f3>f3</f3>
              </F>
          </F>
      <gotit>gotit</gotit>
    </F>
    </Next>
</Top>

Mongo DB Connection error.

$
0
0
Hello All,
I am getting following error while connecting to Mongo DB on remote PC.

"Unexpected names retrieving database names from Mongo DB.Check your connection details."

My setting for Mongo DB are:
Host:- 192.168.1.102
Port:-27017
AuthDatabase : -histdb (database name of mongo db)
user:- user1
password:-password

When I click on GetDB I get the error mentioned above.


Note: - Previously I was not using any userID/Pwd for accesing my db .i.e. my db was not aunthticated .In that case I was able to connect to Mongo DB.But now I have put userID/pwd and I am not able to connect to DB.

Kindly Support

Mongo DB Connection Issue

$
0
0
Hello All,
I am getting following error while connecting to Mongo DB on remote PC.

"Unexpected names retrieving database names from Mongo DB.Check your connection details."

My setting for Mongo DB are:
Host:- 192.168.1.102
Port:-27017
AuthDatabase : -histdb (database name of mongo db)
user:- user1
password:-password

When I click on GetDB I get the error mentioned above.


Note: - Previously I was not using any userID/Pwd for accesing my db .i.e. my db was not aunthticated .In that case I was able to connect to Mongo DB.But now I have put userID/pwd and I am not able to connect to DB.

Kindly Support

Job Mail Attachments selection

$
0
0
Hi everybody,
I've spent some time lurking around information regarding how to attach files in the "Mail" step of a job.
What I've ended up finding is that flagging "Attach file(s) to message", and selecting "General" as file type, all the files processed in the Job are attached to the mail.

My situation is the following:
I process a total of three files in the job's transformations, two of them need to be attached to the mail at the end of the job, but one not.
That file is imported in the DB, and later on, moved to a "processed" folder. When the Mail step is performed, the file can't be found and therefore the Job fails.

I don't want to add that file as attachment to the mail, but so far I didn't find the way to select specific files as attachments.

I hope I've been clear enough, could anyone help me please?

Many thanks in advance!

Database Connections

$
0
0
Hello guys!

The problem:

I have a job with 30 transformations each with a connection (all the same connection).
Now i have to change the password of the connection user.
I would not like to have to change all the passwords in each transformation, however, i think it will be necessary in this case.

The doubt:
Is there any way to avoid this in the future?

Thanks!

Create a Table

$
0
0
Quote:

Originally Posted by gutlez View Post
Seems a terrible workflow design plan.
But yes, you can read the first line of the CSV to get the headers (column names), and feed that to a Meta-Data Injector that creates the tables, and then another to load the CSV into that table.

Hello sir, if you happen to have .ktr file with an example, could you please post it here? Thank you.
Right now I'm trying to do it with dynamic table creation example, but this job cannot find destination table, which supposed to be created at first transformation "create table". But I got the error on the "populate table" transformation, saying that "POPULATE CUSTOMERS" table doesn't exist. I'm not posting this transformation, as it is part of samples/transformations/dynamic-table examples

Appreciate your valuable inputs.

Dynamic Table Creation

$
0
0
I want to do slightly different. I have CSV file with 20+ columns and I want to create a table in DB before using the output. It must be some way to generate DDL for all header values and type (int, varchar, date) based on CSV data. Looks like ETL metadata injection is a right step here, but I'm failed to figure out from the tutorial how to use it. Do someone have working kettle transformation or job example? Thank you!

Decrypt with PGP

$
0
0
I have a job set up that decrypts some gpg files using Pentaho and gpg4win, everything works just fine at first until it runs overnight as part of a batch process. It seems to forget the passphrase or PDI does not send the passphrase from the step. The passphrase is hard coded in the decrypt step but the job will still fail overnight.

After a failure I can open PDI and run the job and a pop up for the passphrase appears, after I enter it the job will run fine.

Am I overlooking some configuration or missing something? There is very little documentation on the PGP steps.

Populating a Fact Table

$
0
0
I'm very new to using pentaho/ any kinds of ETL development. Please forgive me if I'm not clear at explaining what I am looking for.

I've created a small data warehouse to house a set of surveys gathered annually through Qualtrics for several years.

The data warehouse consists of 5 tables:
dim_date
year_id
year

dim_institution
institution_id
institution_name

dim_Program
program_id
program_name

dim_question
question_id
question_name

fact_response
year_id
institution_id
program_id
question_id
response

I've populated each dimension tables but I don't know how to populate the fact table.
I guess I'm not sure how to transform the csv containing all of the responses into something useable for my fact table. Does this question make sense? Sorry if this is a confusing question or I am asking it wrong. If there is anything you can point me to to read I would much appreciate it.

pentaho bi server v7 password policy

Handle cookies in Rest Client?

$
0
0
Hi,

is there some way I can handle cookies in Rest Client (save and post)?
Rest Client does not have any option to manage them?

Regards

delete first space

$
0
0
Hi,

i want to delete the first space for all cells in one column.

The data typ of the column is string. I am using "sting operation". here i select: trim type = left and remove special character = space.
But with this settings all spaces will be deleted. i just want so delete the first space.

Thx :-)
Viewing all 16689 articles
Browse latest View live


<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>