Quantcast
Channel: Pentaho Community Forums
Viewing all 16689 articles
Browse latest View live

Count columns in a row

$
0
0
Hi!!
I would like to know if there is a way to know the number of columns in a row. I am importing some data from a file (it corresponds to a matrix that will have different sizes each time) and I would like to be able to know how many columns have been imported.

Thanks and kind regards,
Alain

cdf dashboard Error on first login

$
0
0
Hi All,

I have designed cdf dashboard and called the url web services in my application.whenever i access the first session of browser ,

i am getting Syntax error and dash dashboard is not defined then refreshed same url its working fine.please help on this how to debug my code

i have attached the my error screenshot

error.jpg
Attached Images

Overview and description of all Carte Web Services?

Load a text file in reverse order

$
0
0
Can we load the rows in a text file in reverse order? I want to load the last 200K rows.

Print directly to printer

$
0
0
Hi all... is there any possibility to print a report with a xaction directly to printer without preview of the document?
Thanks to all...

Open Business Analytics Training in London #BI #BigData #ETL #OLAP

$
0
0
Training Main page Training Dates:  From 28th April to 1st May 2014 Duration: 24 hours. 4 days Location: Executive offices group meeting rooms. London. Address: Central Court, 25 Southampton Bldgs – WC2A 1AL . Training contents: DAY 1 Business Intelligence Open Source Introduction … Continue reading →

More...

Switch between Development and Production Environment

$
0
0
I would like to be able to switch between a development (testing) environment and a production environment without changing my scripts. I tried this procedure:
  1. copying all my production database connection files (.kdb) to a directory outside of my Kettle Repository
  2. changing all the database connections inside my Kettle Repository to point to the development databases
  3. copying these development database connection files to a second directory outside of my Kettle Repository
  4. Creating two shell scripts / batch files to copy the either the development connections or the production connections to my Kettle Repository


This didn't work. Even though all my database connections are shared, not all are in .kdb files, and it appears that at least one copy of all database connections are inside each and every transformation (.ktr).

So, I'd like to ask how everyone else is switching between development and production scripts.

Some database connection information stored outside of Kettle Repository

$
0
0
Some database connection information is evidently stored outside of the Kettle Repository. This makes things very difficult to work in a development group, since checking the Kettle Repository into Subversion (or whatever) will not save and restore all of the required information for a project.

Is there a workaround? Should I submit a bug?

Creating a simple hyperlink

$
0
0
Hello, everybody

I am new to Pentaho reporting, so my apologies if answer to this question might seem obvious .
I need to create a simple static hyperlink in an HTML report footer. All documentation points to url property of an element, but for some reason it does not work for me.
I add a message field to the report footer and set its url property to some string and run the report as HTML. I can see field content, but it does not have <a> tag around it.
I also tried to use rich-text-type property set to text/html and enter link html manually, but it does not work either. Resulting report replaces actual hyperlink with formatting which imitates it (blue font, underlined). Search through these forums and user manual did not give any hints. I am sure such a simple thing is possible to do, but I am missing something.

Thank you for your time.

Rita Chupalov

Improvements to Optiq's MongoDB adapter

$
0
0
It’s been a while since I posted to this blog, but I haven’t been idle. Quite the opposite; I’ve been so busy writing code that I haven’t had time to write blog posts. A few months ago I joined Hortonworks, and I’ve been improving Optiq on several fronts, including several releases, adding a cost-based optimizer to Hive and some other initiatives to make Hadoop faster and smarter.

More about those other initiatives shortly. But Optiq’s mission is to improve access to all data, so here I want to talk about improvements to how Optiq accesses data in MongoDB. Optiq can now translate SQL queries to extremely efficient operations inside MongoDB.

MongoDB 2.2 introduced the aggregation framework, which allows you to compose queries as pipelines of operations. They have basically implemented relational algebra, and we wanted to take advantage of this.

As the following table shows, most of those operations map onto Optiq’s relational operators. We can exploit that fact to push SQL query logic down into MongoDB.

MongoDB operator Optiq operator
$project ProjectRel
$match FilterRel
$limit SortRel.limit
$skip SortRel.offset
$unwind -
$group AggregateRel
$sort SortRel
$geoNear -

In the previous iteration of Optiq’s MongoDB adapter, we could push down project, filter and sort operators as $project, $match and $sort. A bug pointed out that it would be more efficient if we evaluated $match before $project. As I fixed that bug yesterday, I decided to push down limit and offset operations. (In Optiq, these are just attributes of a SortRel; a SortRel sorting on 0 columns can be created if you wish to apply limit or offset without sorting.)

That went well, so I decided to go for the prize: pushing down aggregations. This is a big performance win because the output of a GROUP BY query is often a lot smaller than its input. It is much more efficient for MongoDB aggregate the data in memory, returning a small result, than to return a large amount of raw data to be aggregated by Optiq.

Now queries involving SELECT, FROM, WHERE, GROUP BY, HAVING, ORDER BY, OFFSET, FETCH (or LIMIT if you prefer the PostgreSQL-style syntax), not to mention sub-queries, can be evaluated in MongoDB. (JOIN, UNION, INTERSECT, MINUS cannot be pushed down because MongoDB does not support those relational operators; Optiq will still evaluate those queries, pushing down as much as it can.)

Let's see some examples of push-down in action.

Given the query:
SELECT state, COUNT(*) AS c
FROM zips
GROUP BY state
Optiq evaluates:
db.zips.aggregate(
{$project: {STATE: '$state'}},
{$group: {_id: '$STATE', C: {$sum: 1}}},
{$project: {STATE: '$_id', C: '$C'}})
and returns
STATE=WV; C=659
STATE=WA; C=484
...
Now let’s add a HAVING clause to find out which states have more than 1,500 zip codes:
SELECT state, COUNT(*) AS c
FROM zips
GROUP BY state
HAVING COUNT(*) > 1500
Optiq adds a $match operator to the previous query's pipeline:
db.zips.aggregate(
{$project: {STATE: '$state'}},
{$group: {_id: '$STATE', C: {$sum: 1}}},
{$project: {STATE: '$_id', C: ‘$C'}},
{$match: {C: {$gt: 1500}}})
and returns
STATE=NY; C=1596
STATE=TX; C=1676
STATE=CA; C=1523
Now the pièce de résistance. The following query finds the top 5 states in terms of number of cities (and remember that each city can have many zip-codes).
SELECT state, COUNT(DISTINCT city) AS cdc
FROM zips
GROUP BY state
ORDER BY cdc DESC
LIMIT 5
COUNT(DISTINCT x) is difficult to implement because it requires the data to be aggregated twice — once to compute the set of distinct values, and once to count them within each group. For this reason, MongoDB doesn’t implement distinct aggregations. But Optiq translates the query into a pipeline with two $group operators. For good measure, we throw in ORDER BY and LIMIT clauses.

The result is an awe-inspiring pipeline that includes two $group operators (implementing the two phases of aggregation for distinct-count), and finishes with $sort and $limit.

db.zips.aggregate(
{$project: {STATE: '$state', CITY: '$city'}},
{$group: {_id: {STATE: '$STATE', CITY: '$CITY'}}},
{$project: {_id: 0, STATE: '$_id.STATE', CITY: '$_id.CITY'}},
{$group: {_id: '$STATE', CDC: {$sum: {$cond: [ {$eq: ['CITY', null]}, 0, 1]}}}},
{$project: {STATE: '$_id', CDC: '$CDC'}},
{$sort: {CDC: -1}}, {$limit: 5})
I had to jump through some hoops to get this far, because MongoDB’s expression language can be baroque. In one case I had to generate
{$ifNull: [null, 0]}
in order to include the constant 0 in a $project operator. And I was foiled by MongoDB bug SERVER-4589 when trying to access the values inside the zips table's loc column, which contains (latitude, longitude) pairs represented as an array.

In conclusion, Optiq on MongoDB now does a lot of really smart stuff. It can evaluate any SQL query, and push down a lot of that evaluation to be executed efficiently inside MongoDB.

I encourage you to download Optiq and try running some sophisticated SQL queries (including those generated by the OLAP engine I authored, Mondrian).

More...

Running two Pentaho Instances in the same machine pointing to a single MYSQL Repos

$
0
0
Hi All,

I have installed Pentaho 4.1 in my server configured with Mysql repository. Is it possible to install another instance of Pentaho i.e Pentaho 5 in the same server and make them both running at the same time making use of the same mysql repository that is being used by Penatho 4.1 .

Doable ?

Thanks in Advance:D

Nested Expressions Issue.

$
0
0
Hi,
In Pentaho Reporting tool i have been trying to implement a few features using expressions. When i tried to use the following nested expression, i get an problem in execution.

=IF(CONCATENATE(FIXED(((["Column1"]/["Column2"])*100);2);" %")>0;"True";"False")

The problem is:

The IF statement is taking only "CONCATENATE(FIXED(((["Column1"]/["Column2"])*100" as the logic and since there is a semi-colon(;) after that, it is taking "2" as the first result for IF and the remaining part as the Otherwise Result. But "CONCATENATE(FIXED(((["Column1"]/["Column2"])*100);2);" %")>0" completely is the logic and
"True";"False" are the results. I hope i am clear enough in what i actually want. Please help me with this issue.

Is it my mistake in the code or is it an inbuilt error. Please help me out with this.

Thanks in advance.
Ritesh.

How to limit Multi Select Component???

$
0
0
Dear all,

I have a multi select component on my dashboard.
The multi select component can select multiple items.
But I would like to limit it. I would like to select only 5 item.

I have find the setting but I cannot find yet.

If anyone know about this, please let me know.

Thank you all.....

[Saiku] measuresCaption not working

$
0
0
Hello,

I am working with an i18n schema and all captions are working fine except measuresCaption. It works on jpivot but not on saiku (other captions are working fine on saiku):
Code:

<Schema name="mySchema" measuresCaption="Medidas">
Saiku:saiku.pngJpivot:jpivot.png
Any ideas on what's going on?
Regards.
Attached Images

mySQL Connection

$
0
0
Hi,

I'm trying to connect to a mySQL connection, I installed mySQL cluster and mysql-connector-odbc-5.2.6-win32.msi but surely I miss something (driver maybe? I don't know where and what to download)
When I try to create a connection I get the error message attached.

Thanks so much
Attached Images

How to use Data Source created using CSV file on CDE based dashboard?

$
0
0
I have created a new data source using the CSV file in Pentaho 5.0.
I am able to use this data source on Analyzer Report & Interactive Report, but how can we use the same data source on the CDE dashboard to bind the CTools CCC charts.

Thanks!

join logging tables

$
0
0
Hello all!

I guess the best way of logging information in Kettle is to use the standard logging tables and -routines (so, the ones that can be found and turned on in the properties of both transformations and jobs). I now turned them on for jobs, job entries and transformations and get some nice logging in these 3 tables, so all works quite nicely.

Now my question is: are there any "foreign keys" between these tables? Obviously no FK is created as a DB object but what I mean is more like: can I join these tables and if so, on which columns? E.g. if I want to see all transformations that were run during execution of a certain job. Strangely, I couldn't google this information. Someone must have had the same problem before?!?

On another note, I would be interested in adding some information to these tables, e.g. some of the stuff that I can get through the "Get System Info"-step (e.g. command line arguments). Would you recommend that and how would you implement it?

Thanks for your help!
/D

How to format a number in a dashboard?

$
0
0
Hi guys,
can somebody tell how to format a number in a Dashboard. A kind of function like number_format() of PHP. I think Javascript doesn't have this function. help please.

thanks.

Web Development | Web Designing | Web Hosting | SEO Services

$
0
0
Ave Infosys is a leading professional in Web Designing Company in Hyderabad India for the E-Business Industry.Ave Infosys are providing Best Website Development and Design Services in Hyderabad.Our company offers the Best Web Design & Development services, Web Hosting services. We have intensive web design and web skills merging with the quality essence of expertise should have component to help you to ascertain your internet presence or take it to the next level.we are the Best Web Design Company in Hyderabad.

For More Details :

Please contact: (+91) 40 40275321

Email : info@aveinfosys.com

web : http://aveinfosys.com/

Need to update the OLAP table on new entry automatically

$
0
0
Hi,

Please help me to update my OLAP table automatically on new record
entry in the datawarehouse by certain time intreval.
Viewing all 16689 articles
Browse latest View live


<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>