分享

SparkStreaming获取hdfs数据问题

jttsai 发表于 2014-12-19 17:08:34 [显示全部楼层] 只看大图 回帖奖励 阅读模式 关闭右栏 4 49039
本帖最后由 pig2 于 2014-12-19 18:30 编辑

如下图所示,在hdfs上的jtt目录下,是有aa,bb两个文件,里面也是有数据的
下面还有我些的代码,最后则为执行的命令
不知道问题出在那里,执行命令后:
14/12/19 16:50:10 INFO SendingConnection: Connected to [slave1.hadoop/20.26.19.18:34424], 1 messages pending
14/12/19 16:50:10 INFO BlockManagerInfo: Added broadcast_3_piece0 in memory on slave1.hadoop:34424 (size: 1353.0 B, free: 265.0 MB)
14/12/19 16:50:10 INFO MapOutputTrackerMasterActor: Asked to send map output locations for shuffle 1 to sparkExecutor@slave1.hadoop:60896
14/12/19 16:50:10 INFO TaskSetManager: Finished task 0.0 in stage 6.0 (TID 3) in 681 ms on slave1.hadoop (1/1)
14/12/19 16:50:10 INFO TaskSchedulerImpl: Removed TaskSet 6.0, whose tasks have all completed, from pool
14/12/19 16:50:10 INFO DAGScheduler: Stage 6 (take at DStream.scala:608) finished in 0.691 s
14/12/19 16:50:10 INFO SparkContext: Job finished: take at DStream.scala:608, took 0.747572378 s
-------------------------------------------
Time: 1418979010000 ms
-------------------------------------------

14/12/19 16:50:10 INFO JobScheduler: Finished job streaming job 1418979010000 ms.0 from job set of time 1418979010000 ms
14/12/19 16:50:10 INFO JobScheduler: Total delay: 0.943 s for time 1418979010000 ms (execution: 0.923 s)
14/12/19 16:50:10 INFO ShuffledRDD: Removing RDD 4 from persistence list
14/12/19 16:50:10 INFO BlockManager: Removing RDD 4
14/12/19 16:50:10 INFO MappedRDD: Removing RDD 3 from persistence list
14/12/19 16:50:10 INFO BlockManager: Removing RDD 3
14/12/19 16:50:10 INFO FlatMappedRDD: Removing RDD 2 from persistence list
14/12/19 16:50:10 INFO BlockManager: Removing RDD 2
14/12/19 16:50:10 INFO MappedRDD: Removing RDD 1 from persistence list
14/12/19 16:50:10 INFO BlockManager: Removing RDD 1

最后的命令

最后的命令



写的代码

写的代码



hdfs目录,是有数据的

hdfs目录,是有数据的







已有(4)人评论

跳转到指定楼层
muyannian 发表于 2014-12-19 18:49:28
建议楼主把问题描述清楚些
回复

使用道具 举报

exinbiti 发表于 2015-7-29 11:45:38
楼主问题解决了吗?
回复

使用道具 举报

xmhxmhxmh 发表于 2016-5-10 13:42:32
楼主问题解决了吗,我也遇到了同样的问题,sparkstreaming 读取hdfs文件时,报找不到文件的错误?
回复

使用道具 举报

wx_RYClUEop 发表于 2017-2-21 19:32:04
楼主最后解决了吗,这个问题
回复

使用道具 举报

您需要登录后才可以回帖 登录 | 立即注册

本版积分规则

关闭

推荐上一条 /2 下一条