Parse Log output from Hadoop Yarn MapReduce DFSIO Benchmark Utility to CSV Files
Recently I've been spending my time running lots of dfsio benchmark jobs.
The DFSIO utility is part of the Hadoop distribution and can be found in jars located in ./hadoop/share/mapred – the JARS have a name like "hadoop-mapreduce-client-jobclient-*-tests.jar.
Output from the tool is a