How do I remove duplicates from a spark RDD based on specific columns
566,26,adidas Men’s Germany Black/Red Away Match Soc,90.0,http://images.acmesports.sports/adidas+Men’s+Germany+Black%2FRed+Away+Match+Soccer+Jersey
569,26,adidas Men’s Germany Home Soccer Jersey,90.0,http://images.acmesports.sports/adidas+Men’s+Germany+Home+Soccer+Jersey
560,26,adidas Men’s 2014 MLS All-Star Game Replica B,85.0,http://images.acmesports.sports/adidas+Men’s+2014+MLS+All-Star+Game+Replica+Black+Jersey
565,26,adidas Youth Germany Black/Red Away Match Soc,70.0,http://images.acmesports.sports/adidas+Youth+Germany+Black%2FRed+Away+Match+Soccer+Jersey
549,26,Lotto Men’s Zhero Gravity V 700 TF Soccer Cle,59.99,http://images.acmesports.sports/Lotto+Men’s+Zhero+Gravity+V+700+TF+Soccer+Cleat
551,26,Lotto Men’s Zhero Gravity V 700 TF Soccer Cle,59.99,http://images.acmesports.sports/Lotto+Men’s+Zhero+Gravity+V+700+TF+Soccer+Cleat
552,26,Lotto Men’s Zhero Gravity V 700 TF Soccer Cle,59.99,http://images.acmesports.sports/Lotto+Men’s+Zhero+Gravity+V+700+TF+Soccer+Cleat
I want to remove row 6 and row 7 based on all columns other than the first.