Madhukara Phatak

Big data consultant and trainer
Bangalore, India [email protected]

Big data developer passionate about building products and services to improve people's life. Open source enthusiast.

Experience

Bigdata consultant at Data Mantra, Aug, 2014 - Current

  • Consult on Hadoop, Spark and Ecosystem projects

Bigdata Developer and Lead at Zinnia Systems, July,2013- Aug, 2014

  • Worked on Hadoop,Spark,BA products
  • Developed machine learning application on Spark

Team Lead - Product framework team at Zinnia Systems, July,2012- July, 2013

  • Lead the team to develop in house framework to build BSS/OSS products
  • HTML/CSS/JS and J2EE
  • API design and release management

Architect on Hadoop at Virtusa,Hyderabad, Jan,2013- July, 2013

  • Develop overall architecture and technically helping developers to choose API's

Intern at HP, Feb,2010- April, 2010

  • Worked on porting tools from Linux to HP operating system

Projects

Spark consulting to Decision Mapper USA, April 2015 - Current

    Working on
  • Building Spark and Spark SQL based platform for financial analytics
  • Building automated docker based deployment infrastructure for Hadoop and Spark
  • Data unification across RDBMS, Parquet, Hive sources
  • Scaling financial aggregation and analytics on large clusters
  • Interactive tooling with Zeppelin

Mobile payment consulting on Spark and Scala for JusPay India , April 2015

    Worked on
  • Building real time event processing system based on spark streaming to handle real time payment information
  • Designing over all architecture for data modeling

Twitter sentiment analysis on Spark at Zinnia Systems

  • Building machine learning based models for twitter sentiment analysis on spark

Bigdata Architecture at Virtusa,Hyderabad

  • Lead a team of 30+ developers to deliver a Hadoop based project . Mainly involved in architecting the solution around hadoop and hadoop ecosystem technologies

Nectar - Open source predictive modeling framework

Framework development

  • Lead a team to develop in house framework for BSS/OSS product
  • HTML/JS/CSS and J2EE based framework
  • Advanced JS development
  • Git based development and jenkins release management

Training

Trained more than 2000 people on Hadoo,Spark and ecosystem projects. The following are few trainings done by me. You can find some of videos of my training on youtube.

Apache Spark Developer Training at SpringPeople, May 2015

  • 2 days developer training
  • Spark, Spark Streaming and Spark SQL
  • Real world use case project
  • 4 people batch

Apache Spark Developer Training at JusPay India, April 2015

  • 5 days developer training
  • Spark, Spark Streaming and Spark SQL
  • Real world use case project
  • 8 people batch

Advanced Hadoop Data training at Genpact, Bangalore, July 2014

  • 5 days developer training
  • Advanced topics like YARN, Hadoop Federation, Spark
  • Map/Reduce, HDFS, Hive, Pig Hands on
  • Real world use case project
  • 20 people batch

Hadoop Data scientist at Motorola, Bangalore, April 2013

  • 3 days data scientist training
  • Map/Reduce, HDFS, Hive, Pig Hands on
  • 15 people batch

Hadoop Developer training at Motorola, Bangalore, March 2013

  • 5 days developer training
  • Map/Reduce, HDFS, Hive, Pig Hands on
  • 15 people batch

Other trainings

  • 5 days Hadoop Developer training at ITC, Bangalore, Jan 2013
  • Hadoop Developer training at Genpact, Bangalore, Dec 2012
  • Hadoop Data scientist training at CityBank, Bangalore, Oct 2012
  • Hadoop Developer public trainings at Idea labs
  • Hadoop Developer training at Wipro, Bangalore, Jan 2012
  • Hadoop Developer training at Virtusa, Hyderabad, Dec 2011

Open source contribution

Hadoop

Other projects

  • Macroid
  • Open source Projects

    Publication

    On Cloud Computing Deployment Architecture

    IEEE paper on new way of looking at cloud deployments which allow higher flexibility and accelerated development.

    Distributed Computing in Business Analytics

    White paper on new way of looking at cloud deployments which allow higher flexibility and accelerated development.

    Talks

    Skills

    Big data skills
    Hadoop, HDFS, Map/Reduce, Spark, Hive, Pig, Sqoop, Hbase, Zookeeper, Shark, MLLib, YARN
    Languages
    Java, Scala, JavaScript, C, C++
    Java frameworks
    JSP, Servlet, Hibernate
    Architecture
    Apache Tuscany Web service, REST, MEAN stack
    Mobile
    Android development
    Front end
    Prototype.js, Bootstrap, JQuery
    Databases
    Mysql, Oracle
    Development tools
    Git, Jenkins, Eclipse, Intellij