本帖最后由 pig2 于 2014-12-19 18:30 编辑
如下图所示,在hdfs上的jtt目录下,是有aa,bb两个文件,里面也是有数据的
下面还有我些的代码,最后则为执行的命令
不知道问题出在那里,执行命令后:
14/12/19 16:50:10 INFO SendingConnection: Connected to [slave1.hadoop/20.26.19.18:34424], 1 messages pending
14/12/19 16:50:10 INFO BlockManagerInfo: Added broadcast_3_piece0 in memory on slave1.hadoop:34424 (size: 1353.0 B, free: 265.0 MB)
14/12/19 16:50:10 INFO MapOutputTrackerMasterActor: Asked to send map output locations for shuffle 1 to sparkExecutor@slave1.hadoop:60896
14/12/19 16:50:10 INFO TaskSetManager: Finished task 0.0 in stage 6.0 (TID 3) in 681 ms on slave1.hadoop (1/1)
14/12/19 16:50:10 INFO TaskSchedulerImpl: Removed TaskSet 6.0, whose tasks have all completed, from pool
14/12/19 16:50:10 INFO DAGScheduler: Stage 6 (take at DStream.scala:608) finished in 0.691 s
14/12/19 16:50:10 INFO SparkContext: Job finished: take at DStream.scala:608, took 0.747572378 s
-------------------------------------------
Time: 1418979010000 ms
-------------------------------------------
14/12/19 16:50:10 INFO JobScheduler: Finished job streaming job 1418979010000 ms.0 from job set of time 1418979010000 ms
14/12/19 16:50:10 INFO JobScheduler: Total delay: 0.943 s for time 1418979010000 ms (execution: 0.923 s)
14/12/19 16:50:10 INFO ShuffledRDD: Removing RDD 4 from persistence list
14/12/19 16:50:10 INFO BlockManager: Removing RDD 4
14/12/19 16:50:10 INFO MappedRDD: Removing RDD 3 from persistence list
14/12/19 16:50:10 INFO BlockManager: Removing RDD 3
14/12/19 16:50:10 INFO FlatMappedRDD: Removing RDD 2 from persistence list
14/12/19 16:50:10 INFO BlockManager: Removing RDD 2
14/12/19 16:50:10 INFO MappedRDD: Removing RDD 1 from persistence list
14/12/19 16:50:10 INFO BlockManager: Removing RDD 1
最后的命令
写的代码
hdfs目录,是有数据的
|