I designed steps in Pentaho in such a way where it produced first output file for certain number records and second output file for another records. Instead of producing two output files, I'd like to combine them into one output file. Maybe write them into memory, and sort, then write to a file? I am aware of the blocking step and prioritize stream steps but those steps do not sort data. Is that correct? The fields in each records have column names and types that match. Is there a model where it shows some kind of concatentation of the column names and types, sorting of the data, before writing to one file?
↧