I am using PDI 5.0.2 and connected to a Hadoop v0.20.2 instance running on a VM through Virtual Box. The VM is running Ubuntu and Hadoop is in single node mode. When I try to copy a file from my local filesystem to HDFS on the VM, I keep getting the error:
2014/02/04 15:13:22 - Hadoop Copy Files - ERROR (version 5.0.2, build 1 from 2013-12-04_15-52-25 by buildguy) : File System Exception: Could not copy "file:///C:/pentaho/5.0.2/samples/weblogs_rebuild.txt" to "hdfs://test:***@10.0.0.30:54310/user/pdi/weblogs/raw/weblogs_rebuild.txt".
2014/02/04 15:13:22 - Hadoop Copy Files - ERROR (version 5.0.2, build 1 from 2013-12-04_15-52-25 by buildguy) : Caused by: Could not close the output stream for file "hdfs://test:***@10.0.0.30:54310/user/pdi/weblogs/raw/weblogs_rebuild.txt".
2014/02/04 15:13:22 - Hadoop Copy Files - ERROR (version 5.0.2, build 1 from 2013-12-04_15-52-25 by buildguy) : Caused by: Connection refused: no further information
I can connect to HDFS via the browse functionality in the Hadoop Copy Files step and can see all folders and files in HDFS.
Any suggestions on what to try?
2014/02/04 15:13:22 - Hadoop Copy Files - ERROR (version 5.0.2, build 1 from 2013-12-04_15-52-25 by buildguy) : File System Exception: Could not copy "file:///C:/pentaho/5.0.2/samples/weblogs_rebuild.txt" to "hdfs://test:***@10.0.0.30:54310/user/pdi/weblogs/raw/weblogs_rebuild.txt".
2014/02/04 15:13:22 - Hadoop Copy Files - ERROR (version 5.0.2, build 1 from 2013-12-04_15-52-25 by buildguy) : Caused by: Could not close the output stream for file "hdfs://test:***@10.0.0.30:54310/user/pdi/weblogs/raw/weblogs_rebuild.txt".
2014/02/04 15:13:22 - Hadoop Copy Files - ERROR (version 5.0.2, build 1 from 2013-12-04_15-52-25 by buildguy) : Caused by: Connection refused: no further information
I can connect to HDFS via the browse functionality in the Hadoop Copy Files step and can see all folders and files in HDFS.
Any suggestions on what to try?