As Data Engineer at MapR Technologies, Mathieu Dumoulin works a lot with Hadoop (MapR’s distribution), Apache Spark, Apache Drill, Elasticsearch/Kibana, and Kafka/MapR Streams for real-time event-driven processing. The MapR Converged Data Platform integrates the power of Hadoop and Spark with global event streaming, real-time database capabilities, and enterprise storage for developing and running innovative data applications.
Dumoulin started using Hadoop in 2012 while at the Fujitsu Innovation Lab, where he implemented a complete Hadoop-based text classification pipeline with Mahout. Since then, he’s moved to Tokyo where his love of all things data has led him to log time as search engineer, data scientist, and data engineer. These days, his main interests are in real-time IoT applications, large scale Complex Event Processing, and distributed Deep Learning.