Hi,
I have run into an issue and I'm wondering if anyone has any ideas about what the problem might be. I've designed a job that at a high-level does the following:
1. Connects to an FTP server.
2. Downloads a file.
3. Performs a transformation on the downloaded file.
When I run the job locally via the Pentaho Data Integration program on my Windows 7 PC, it runs successfully.
However, we have a web application that runs under JBOSS 7.1 on a Linux server and there's a section of that web application that runs jobs using the Pentaho Java classes(e.g. org.pentaho.di.job.Job). When I copy the .kjb and .ktr files from my PC to the JBOSS/Linux server and try to run the same job using the Pentaho Java classes it appears to hang at some point during the FTP communication. The following entries are written to our application log by the Pentaho Java classes:
2013-05-07 15:27:42,011 INFO [org.pentaho.di] Main - Start of job execution
2013-05-07 15:27:42,034 INFO [org.pentaho.di] Main - Starting entry [CCLGetFtpFile]
2013-05-07 15:27:42,095 INFO [org.pentaho.di] GetFtpFile - Starting entry [Delete file]
2013-05-07 15:27:42,114 INFO [org.pentaho.di] Delete file - File [${BasePath}/<Path to File>] already deleted.
2013-05-07 15:27:42,115 INFO [org.pentaho.di] GetFtpFile - Starting entry [Get a file with FTP]
2013-05-07 15:27:42,125 INFO [org.pentaho.di.job.entries.ftp.JobEntryFTP] Started FTP job to <FTP Server>
2013-05-07 15:27:42,307 DEBUG [org.pentaho.di.job.entries.ftp.MVSFileParser] Ignoring the set locale.
It appears to hang after the log entry by org.pentaho.di.job.entries.ftp.MVSFileParser.
I am able to manually connect to the FTP server fine using an FTP client from the command line on the JBOSS/Linux server so I don't believe there are any networking issues between the JBOSS/Linux server and the FTP server.
The one interesting thing I did notice is that when I manually connect to the FTP server from the command line on the JBOSS/Linux server, according to the "netstat" output the connection uses tcp/IPv4. However, when the application that runs the job via the Pentaho Java classes connects to the FTP server, according to the "netstat" output the connection uses tcp6/IPv6. I'm not sure what impact, if any, that has.
Has anyone ever run into anything similar to this or have any ideas what might be happening?
Thanks,
Chris
I have run into an issue and I'm wondering if anyone has any ideas about what the problem might be. I've designed a job that at a high-level does the following:
1. Connects to an FTP server.
2. Downloads a file.
3. Performs a transformation on the downloaded file.
When I run the job locally via the Pentaho Data Integration program on my Windows 7 PC, it runs successfully.
However, we have a web application that runs under JBOSS 7.1 on a Linux server and there's a section of that web application that runs jobs using the Pentaho Java classes(e.g. org.pentaho.di.job.Job). When I copy the .kjb and .ktr files from my PC to the JBOSS/Linux server and try to run the same job using the Pentaho Java classes it appears to hang at some point during the FTP communication. The following entries are written to our application log by the Pentaho Java classes:
2013-05-07 15:27:42,011 INFO [org.pentaho.di] Main - Start of job execution
2013-05-07 15:27:42,034 INFO [org.pentaho.di] Main - Starting entry [CCLGetFtpFile]
2013-05-07 15:27:42,095 INFO [org.pentaho.di] GetFtpFile - Starting entry [Delete file]
2013-05-07 15:27:42,114 INFO [org.pentaho.di] Delete file - File [${BasePath}/<Path to File>] already deleted.
2013-05-07 15:27:42,115 INFO [org.pentaho.di] GetFtpFile - Starting entry [Get a file with FTP]
2013-05-07 15:27:42,125 INFO [org.pentaho.di.job.entries.ftp.JobEntryFTP] Started FTP job to <FTP Server>
2013-05-07 15:27:42,307 DEBUG [org.pentaho.di.job.entries.ftp.MVSFileParser] Ignoring the set locale.
It appears to hang after the log entry by org.pentaho.di.job.entries.ftp.MVSFileParser.
I am able to manually connect to the FTP server fine using an FTP client from the command line on the JBOSS/Linux server so I don't believe there are any networking issues between the JBOSS/Linux server and the FTP server.
The one interesting thing I did notice is that when I manually connect to the FTP server from the command line on the JBOSS/Linux server, according to the "netstat" output the connection uses tcp/IPv4. However, when the application that runs the job via the Pentaho Java classes connects to the FTP server, according to the "netstat" output the connection uses tcp6/IPv6. I'm not sure what impact, if any, that has.
Has anyone ever run into anything similar to this or have any ideas what might be happening?
Thanks,
Chris