Did I miss anything in the following query? I am not getting greater than 10000 orders.
ordersmap = ordersRDD.filter(lambda x: int(x.split(",")[2] > ‘10000’) and x.split(",")[3]==‘COMPLETE’)
output:
1,2016-07-29 23:40:47.0,11599,COMPLETE
3,2013-07-25 00:00:00.0,12111,COMPLETE
5,2013-07-25 00:00:00.0,11318,COMPLETE
6,2013-07-25 00:00:00.0,7130,COMPLETE
7,2013-07-25 00:00:00.0,4530,COMPLETE