layout md title Research Big Data Hadoop Spark Operator latencies Sources of overhead Data locality exploration Faster HDFS Integration Fault tolerance with MPI Heterogeneous Computing General purpose graphics processors computing (GPGPU) OpenMP Integration