Quantcast
Channel: Pentaho Community Forums
Viewing all 16689 articles
Browse latest View live

Calculator and Remove CRLF from string seems not to work as expected

$
0
0
I'm reading a file with CRLF in fields (coming from JIRA database), but I'd like to remove the CRLF from the filed (content), so I'm using the CALCULATOR with the appropiate calculation (at least to my opnion), but the results are strange, so what I'm doing not correctly

Field Input (CRLF put in bold by ME, for display purposes:
"Mark mentioned that this issue is not blocking for delivering 8.1.1 to Investec for phase one of the implementation. Therefore this is not a must have for 8.1.1 and we can deliver this in 8.2 or 8.3. I prefer to deliver it asap, so in 8.2 Will discuss with both teams if it can be delivered in 8.2-RC4.
CRLF
Added the Story"

Result field:
B@5c947618

any help is appreciated.Remove CRLF.jpg
Attached Images

Line Height Bug??

$
0
0
I am trying to adjust the line height of a text-field that I have set the attribute "rich-text-type" to "text/html". I believe there may be a bug in PRD and am wondering if someone can help me figure out if I am doing something wrong or if there is a bug. And if there is a bug how to get this fixed. It is very difficult to explain this but I will try my best.

This is what I have found. In PRD when I change the attribute "rich-text-type" from "text/plain" to "text/html" the styling no longer allows me to manipulate the font "family" and "font-size" using the style tab. This is expected behavior.

In order to change the font family and font-size I need to insert in the attribute "value" an expression. Since I want the font to be Times New Roman with a font-size of 10 and the field I am working with is called data1, I insert the following expression into the attribute "value".

="<html><body><span style=""font-family: 'Times New Roman'; font-size: 10pt; "">" & [data1] & " <span></body></html>"

Now my font family and font size come out as I want them (i.e. Times New Roman and font-size 10).

If I want to create more space between the lines I can increase the style "line-height" (although if my text field is under an inline band I need to change the "line-height" of the band). But I can only decrease the space between the lines to a certain amount. This is expected behavior as the line-height can not be smaller than the font. But when I go all the way down to "0" for "line-height" the space between lines when I am using a relatively small font size like 10pt is larger than it should be. Again this behavior occurs when I have set the attribute "rich-text-type" to "text/html". It is even more dramatic if I use a smaller font size like 5pt. Then it is very easy to see that the spacing between the lines is larger than should be.

See this document, which shows reports from the different examples I explained above (Sorry I wan't able to upload an attachment to the pentaho forum site).

What I think is happening here is that when the attribute "rich-text-type" is set to "text/html" PRD picks a default font and font-size. Although one can latter modify the font and font-size PRD still won't allow the line spacing to be smaller than the default font-size. Therefore if one sets the font-size smaller than the default the line spacing will be too big.

So is this a bug? I really need to be able to reduce the line spacing to 0.

Please let me know if I can explain this any better. Thanks.

Double click report download

$
0
0
Hello ,

We are trying to download Double Click (DCM) report from the google DDM reporting site. We do not have pre-defined input step to access /connect DCM reporting.

Could you please help me, if you have worked on these type of requirement? Thanks!!!

External Id for Salesforce Insert/Upsert

$
0
0
Hello,

I was working with kettle and Salesforce, but I have problems when I use an external ID on Pentaho data Integration 7.0, I don't know if Pentaho has any bug about that or I'm doing something wrong, when I make some load to Salesforce, kettel don't give me any issue, but when I check the data, the external id doesn't work.

Anyone knows anything about this problem?

Thanks you!

10-fold-cross-validation in WEKA GUI => Bias because of sequence folding/vectorizing?

$
0
0
Hi all!
I have a question where I am unsure if the implementation of WEKA is misleading or I am mistaken.

Let’s take a simple example: I have 10k documents, 2k of them labeled as positive regarding the target value, 8k labeled as negative. Now, I preprocess the text with a different tool and generate an Input File for WEKA (.arff). If I hand over this file to the WEKA GUI and apply 10-fold-cross-validation with e.g. BayesNet, I get quite good results in terms of precision and recall (P: 71, R: 47). But I guess this is due to the fact that the process in the GUI is

Preprocessing (previously done by me) ==> Vectorizing (previously done by me) ==> Folding (done by WEKA [GUI]) ==> Training & Testing (x10) ⇒ Results [averaged]

when it should be in theory

Preprocessing ==> Folding ==> [Vectorizing ==> Training & Testing] (x10) ⇒ Results [averaged]

Otherwise my train set might contain information which it shouldn’t see before, because features of the test set are already included in the “vector” (and the other way round).

If I use my own implementation with own cross-validation and just use the Classifiers from WEKA via API, my results are far worse (P: 0.60, R: 0.22). I think this might be due to the sequence of the steps of Preprocessing, Vectorizing and Folding. Do I have the correct understanding of the process and might WEKA be misleading with its GUI suggesting it does unbiased 10-fold-cross-validation? Or is WEKA even smart enough to recognize this and “remove” the relevant features for the training?

Thanks, I really appreciate your comments!

Pentaho 7.0 and Community Text Editor plugin

$
0
0
Hi all,
I've tried installing this plugin from the marketplace, as I have it installed and I'm using it in Pentaho 6.1, but I have had to uninstall it in 7.0 because I'm not able to start the server after installing it. I get this error when I try to start Pentaho Server after installing the plugin:

Code:

ERROR [Logger] misc-class org.pentaho.platform.plugin.services.pluginmgr.PentahoSystemPluginManager: PluginManager.ERROR_0011 - Failed to register plugin cte
java.lang.NoClassDefFoundError: org/springframework/security/GrantedAuthority
    at java.lang.Class.getDeclaredMethods0(Native Method)
    at java.lang.Class.privateGetDeclaredMethods(Class.java:2701)
    at java.lang.Class.getDeclaredMethods(Class.java:1975)
    at org.springframework.util.ReflectionUtils.getDeclaredMethods(ReflectionUtils.java:612)
    at org.springframework.util.ReflectionUtils.doWithMethods(ReflectionUtils.java:524)
    at org.springframework.util.ReflectionUtils.doWithMethods(ReflectionUtils.java:510)
    at org.springframework.beans.factory.annotation.AutowiredAnnotationBeanPostProcessor.determineCandidateConstructors(AutowiredAnnotationBeanPostProcessor.java:243)
    at org.springframework.beans.factory.support.AbstractAutowireCapableBeanFactory.determineConstructorsFromBeanPostProcessors(AbstractAutowireCapableBeanFactory.java:1069)
    at org.springframework.beans.factory.support.AbstractAutowireCapableBeanFactory.createBeanInstance(AbstractAutowireCapableBeanFactory.java:1042)
    at org.springframework.beans.factory.support.AbstractAutowireCapableBeanFactory.doCreateBean(AbstractAutowireCapableBeanFactory.java:510)
    at org.springframework.beans.factory.support.AbstractAutowireCapableBeanFactory.createBean(AbstractAutowireCapableBeanFactory.java:482)
    at org.springframework.beans.factory.support.AbstractBeanFactory$1.getObject(AbstractBeanFactory.java:306)
    at org.springframework.beans.factory.support.DefaultSingletonBeanRegistry.getSingleton(DefaultSingletonBeanRegistry.java:230)
    at org.springframework.beans.factory.support.AbstractBeanFactory.doGetBean(AbstractBeanFactory.java:302)
    at org.springframework.beans.factory.support.AbstractBeanFactory.getBean(AbstractBeanFactory.java:197)
    at org.springframework.beans.factory.support.DefaultListableBeanFactory.preInstantiateSingletons(DefaultListableBeanFactory.java:776)
    at org.springframework.context.support.AbstractApplicationContext.finishBeanFactoryInitialization(AbstractApplicationContext.java:861)
    at org.springframework.context.support.AbstractApplicationContext.refresh(AbstractApplicationContext.java:541)
    at org.pentaho.platform.plugin.services.pluginmgr.PentahoSystemPluginManager.reload(PentahoSystemPluginManager.java:286)
    at org.pentaho.platform.plugin.services.pluginmgr.PentahoSystemPluginManager.reload(PentahoSystemPluginManager.java:176)
    at org.pentaho.platform.plugin.services.pluginmgr.PluginAdapter.startup(PluginAdapter.java:40)
    at org.pentaho.platform.engine.core.system.PentahoSystem$2.call(PentahoSystem.java:442)
    at org.pentaho.platform.engine.core.system.PentahoSystem$2.call(PentahoSystem.java:433)
    at org.pentaho.platform.engine.core.system.PentahoSystem.runAsSystem(PentahoSystem.java:412)
    at org.pentaho.platform.engine.core.system.PentahoSystem.notifySystemListenersOfStartup(PentahoSystem.java:433)
    at org.pentaho.platform.engine.core.system.PentahoSystem.access$000(PentahoSystem.java:83)
    at org.pentaho.platform.engine.core.system.PentahoSystem$1.call(PentahoSystem.java:364)
    at org.pentaho.platform.engine.core.system.PentahoSystem$1.call(PentahoSystem.java:361)
    at org.pentaho.platform.engine.core.system.PentahoSystem.runAsSystem(PentahoSystem.java:412)
    at org.pentaho.platform.engine.core.system.PentahoSystem.notifySystemListenersOfStartup(PentahoSystem.java:361)
    at org.pentaho.platform.engine.core.system.PentahoSystem.init(PentahoSystem.java:331)
    at org.pentaho.platform.engine.core.system.PentahoSystem.init(PentahoSystem.java:227)
    at org.pentaho.platform.web.http.context.SolutionContextListener.contextInitialized(SolutionContextListener.java:162)
    at org.apache.catalina.core.StandardContext.listenerStart(StandardContext.java:4853)
    at org.apache.catalina.core.StandardContext.startInternal(StandardContext.java:5314)
    at org.apache.catalina.util.LifecycleBase.start(LifecycleBase.java:145)
    at org.apache.catalina.core.ContainerBase.addChildInternal(ContainerBase.java:725)
    at org.apache.catalina.core.ContainerBase.addChild(ContainerBase.java:701)
    at org.apache.catalina.core.StandardHost.addChild(StandardHost.java:717)
    at org.apache.catalina.startup.HostConfig.deployDirectory(HostConfig.java:1092)
    at org.apache.catalina.startup.HostConfig$DeployDirectory.run(HostConfig.java:1834)
    at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
    at java.util.concurrent.FutureTask.run(FutureTask.java:266)
    at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
    at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
    at java.lang.Thread.run(Thread.java:745)
Caused by: java.lang.ClassNotFoundException: org.springframework.security.GrantedAuthority
    at java.net.URLClassLoader.findClass(URLClassLoader.java:381)
    at java.lang.ClassLoader.loadClass(ClassLoader.java:424)
    at org.pentaho.platform.plugin.services.pluginmgr.PluginClassLoader.loadClass(PluginClassLoader.java:193)
    at java.lang.ClassLoader.loadClass(ClassLoader.java:357)
    ... 46 more

Has someone encountered the same error or has it working with their Pentaho 7.0 installation? Since the marketplace has an specific version for Pentaho 7.0 it seems as it normally works, and perhaps it's some bug with my installation.

My enviroment:
OS: Ubuntu 14.04 32 bits
Pentaho 7.0
JVM 1.8.0_111-8u111-b14-3~14.04.1-b14

Regards

How to integrate CHARINDEX and SUBSTRINg from SQL Server onto a Field in PDI

$
0
0
All

I'd like to achieve the following:

In a field named Comment, I like to search for the starting position of a string (in field ProjectKey cancetenated with a hyphen) and substring the comment string form that position on to the length of another field (IssueKey). To complicate the needed functionality, I'd like to exclude this where the projectkey (concatenated with a hyphen) is not found within the comment string.

In SQL Server I can create function, which does the trick, but I'd like to incorporate this within a PDI-step
CASE WHEN CHARINDEX(ProjectKey + '-', Comment) = 0 THEN NULL ELSE SUBSTRING(Comment, CHARINDEX(ProjectKey + '-', Comment), { fn LENGTH(IssueKey) }) END) As DuplicateKey

How to do this?

Thanks in advance

CRUD application

$
0
0
I am trying to create a CRUD application + dashboards using Pentaho Bi Server.

What is the best way to do that?


Until now, I m considering the integration between Cuba Plataform and Pentaho.

The main problem is in the user and access control integration.

Thank in advance

Regex in Get File Name

$
0
0
Hi folks,
I am trying to get a list of all the names of csv files in a folder so that I can write some regex to pull the specific files I want everyday for manipulation. Pretty simple. For testing I have a transformation with only the Get File Names step in it - it pulls the folder name and details when only the "File/Directory" is populated, but once I add something into the "Wildcard (RegsExp)" field, the transformation sits in an "Idle" status and doesn't run.

I am trying to use .*\.csv as the regex in this field to pull names of all the csv files in this folder. From what I can tell I'm doing this correctly, but when I use regex, I get "Dispacting started for transformation" in the log, and the whole thing just sits there in "Idle" status. Take regex out, works OK, but doesn't give me what I want. Put it back in, I'm waiting around for nothing.

I've tried restarting, creating a new transformation, passing the filepath and regex from a previous step, changing up the regex, but it's all the same . Is there another way to do what I'm trying to do, or am I doing something incorrect?

Pentaho 7.0 and CSV Exports from Viewer

$
0
0
I'm having a particular problem. I had any number of reports which I brought over from 5.4 to 7.0. They all run (save for a few issues with a beanscript datasource) and they export in every format but one: csv.
They will even export csv in PRD 7, but in viewer on the BA Server -- no such luck.

Now, the Steelwheels example still exports "Buyer Report(sparkline report)". I've tried migrating them. Changing the compat level (Buyer Report is still 3.8.0) and several other things.

It is confounding and I was wonder if anyone had the same problem? My reports are mostly cda based -- but I created a report in PRD 7 with a JDBC datasource and...it ran fine...save on the
server with a CSV export.

Any feedback or help would be appreciated. Even just a verification on a server instance which isn't mine.

TIA!

Does PDI support MariaDB as a connection type?

$
0
0
Does PDI support MariaDB as a connection type?

If not, can I create a customized database connection type?

Thank you.

Does Pentaho support Data Virtualization?If yes then how?

$
0
0
Hi All,
I want to know how to achieve data virtualization in Pentaho.
Please help.

Thanks in advance
Ajinkya Gaikwad

Does Pentaho support Data Virtualization?If yes then how?

$
0
0
Hi All,
I want to know how to achieve data virtualization in Pentaho.
Please help.

Thanks in advance
Ajinkya Gaikwad

Core Java Example To Upload Data From CSV file to MYSQL using pentaho API

$
0
0
Hi,

I am new to pentaho and i am not aware much in the pentaho api n all.

I want to use there api in my project to upload data from csv file to MYSQL database using my pain java class main method.

Please suggest the possible way with the poosible list of jar file that are being used and the source link from where i can download the same.

If possible please post the programe for the same.

Looking fro kind support.

Pentaho Data Integration Http Post

$
0
0
Hi:

I have a data extraction job which uses HTTP POST step to hit a website to extract data. the site goes unresponsive after a couple of hits and the program stops. Is there a way that I can make the job do a couple of retries if it doesn't get 200 response at the first hit. (Considering the caution that all the steps in a transformation run in parallel and the input should not be skipped while retries are made). Any help would be much appreciated. Thanks in advance.

CCC Bar Chart - MultiChart : Having a different axis title?

$
0
0
Hi,

Is it possible to have a different axis title for each small chart of a multi bar chart?
I want to display a set of indicators in a multichart, some are in percentage, some in amount, some in quantity and I would liek to adjust the title of the vertical axis.

Thank you

Benoit

random seed and ibk

$
0
0
Hello,

I am using weka to analyze a data set I have.
The question I have is: when I use IBK and specify a value for k, if I change the random seed the results are different for each random seed value I use. Shouldn't it be the same since I did not change the value of k?

Thank you and appreciate your help,

M

GROUP BY dont work properly

$
0
0
Hi,
i have a:

SORT_TRASF->GROUPBY->UNIQUE_ROWS

but group by dont seem work correclty:

SORT:
Fieldname Ascending Case sensitive compare? Presorted?
cdc_served_imei_enel_u Y N N
cdc_served_msisdn_u Y N N
dta_fine_validita_u Y N N
fk_plmn_u Y N N
cdc_served_imsi_u Y N N
fln_presenza_enel_u Y N N
fln_presenza_ti_billing_u Y N N
fln_validita_u Y N N
fk_audit_inserimento_var_u Y N N
fk_audit_aggiornamento_var_u Y N N
pk_time_u Y N N

GROUP BY:
Group field
dta_fine_validita_u
fk_plmn_u
cdc_served_imsi_u
fln_presenza_enel_u
fln_presenza_ti_billing_u
fln_validita_u
fk_audit_inserimento_var_u
fk_audit_aggiornamento_var_u
pk_time_u

Aggregates:
Name Subject Type Value
cdc_served_imei_enel_first cdc_served_imei_enel_u First value
cdc_served_msisdn_first cdc_served_msisdn_u First value

but: cdc_served_imei_enel_first cdc_served_imei_enel_u First value <- dont work infact i have after this step (Preview mode):

dta_fine_validita_u fk_plmn_u cdc_served_imsi_u fln_presenza_enel_u fln_presenza_ti_billing_u fln_validita_u fk_audit_inserimento_var_u fk_audit_aggiornamento_var_u pk_time_u cdc_served_imei_enel_first cdc_served_msisdn_first
2999/12/31 00:00:00 9 222420000001073 1 0 1 20161214154500 20161214154500 482400 220645840014324 393211001073

2999/12/31 00:00:00 9 222420000001073 1 0 1 20161214154500 20161214154500 482400 013214008445060 393211001073




Where is my error?
thanks
Regards
Giusepp

Popup on NewMapComponent

$
0
0
Using Pentaho 6.1 CE and a NewMapComponent with markers data . Need to create a popup to get latitude and longitude of marker clicked but there is no marker click functionon new component...
Any help?



Regards

Error Unable to get fields from previous steps because of an error

$
0
0
When i run my Transformation i have this error:

org.pentaho.di.trans.steps.groupby.GroupByMeta@1b3294c - ERROR (version 7.0.0.0-25, build 1 from 2016-11-05 15.35.36 by buildguy) : Unable to get fields from previous steps because of an error

the running go good but i have this error in console?
WHY?
what is?
I have in input 1 TABLE INPUT with a POSTGRES QUERY with two variable ${VAR1} and ${VAR2}. Can be this a problem? for me is all ok..

thanks
Viewing all 16689 articles
Browse latest View live


Latest Images