Madhukara Phatak
Big data consultant and trainer
Bangalore, India
[email protected]
Big data developer passionate about building products and services to improve people's life. Open source enthusiast.
Experience
Bigdata consultant at Data Mantra, Aug, 2014 - Current
- Consult on Hadoop, Spark and Ecosystem projects
Bigdata Developer and Lead at Zinnia Systems, July,2013- Aug, 2014
- Worked on Hadoop,Spark,BA products
- Developed machine learning application on Spark
Team Lead - Product framework team at Zinnia Systems, July,2012- July, 2013
- Lead the team to develop in house framework to build BSS/OSS products
- HTML/CSS/JS and J2EE
- API design and release management
Architect on Hadoop at Virtusa,Hyderabad, Jan,2013- July, 2013
- Develop overall architecture and technically helping developers to choose API's
Intern at HP,
Feb,2010- April, 2010
- Worked on porting tools from Linux to HP operating system
Projects
Spark consulting to Decision Mapper USA, April 2015 - Current
Working on
- Building Spark and Spark SQL based platform for financial analytics
- Building automated docker based deployment infrastructure for Hadoop and Spark
- Data unification across RDBMS, Parquet, Hive sources
- Scaling financial aggregation and analytics on large clusters
- Interactive tooling with Zeppelin
Mobile payment consulting on Spark and Scala for JusPay India , April 2015
Worked on
- Building real time event processing system based on
spark streaming to handle real time payment information
- Designing over all architecture for data modeling
Twitter sentiment analysis on Spark at Zinnia Systems
- Building machine learning based models for twitter sentiment analysis on spark
Bigdata Architecture at Virtusa,Hyderabad
- Lead a team of 30+ developers to deliver a Hadoop based project . Mainly involved in
architecting the solution around hadoop and hadoop ecosystem technologies
Nectar - Open source predictive modeling framework
Framework development
- Lead a team to develop in house framework for BSS/OSS product
- HTML/JS/CSS and J2EE based framework
- Advanced JS development
- Git based development and jenkins release management
Training
Trained more than 2000 people on Hadoo,Spark and ecosystem projects. The following are few trainings done by me. You can find some of videos of my training on youtube.
Apache Spark Developer Training at SpringPeople, May 2015
- 2 days developer training
- Spark, Spark Streaming and Spark SQL
- Real world use case project
- 4 people batch
Apache Spark Developer Training at JusPay India, April 2015
- 5 days developer training
- Spark, Spark Streaming and Spark SQL
- Real world use case project
- 8 people batch
Advanced Hadoop Data training at Genpact, Bangalore, July 2014
- 5 days developer training
- Advanced topics like YARN, Hadoop Federation, Spark
- Map/Reduce, HDFS, Hive, Pig Hands on
- Real world use case project
- 20 people batch
Hadoop Data scientist at Motorola, Bangalore, April 2013
- 3 days data scientist training
- Map/Reduce, HDFS, Hive, Pig Hands on
- 15 people batch
Hadoop Developer training at Motorola, Bangalore, March 2013
- 5 days developer training
- Map/Reduce, HDFS, Hive, Pig Hands on
- 15 people batch
Other trainings
- 5 days Hadoop Developer training at ITC, Bangalore, Jan 2013
- Hadoop Developer training at Genpact, Bangalore, Dec 2012
- Hadoop Data scientist training at CityBank, Bangalore, Oct 2012
- Hadoop Developer public trainings at Idea labs
- Hadoop Developer training at Wipro, Bangalore, Jan 2012
- Hadoop Developer training at Virtusa, Hyderabad, Dec 2011
Open source contribution
Hadoop
Other projects
Macroid
Open source Projects
Publication
On Cloud Computing Deployment Architecture
IEEE paper on new way of looking at cloud deployments which allow higher flexibility
and accelerated development.
Distributed Computing in Business Analytics
White paper on new way of looking at cloud deployments which allow higher flexibility
and accelerated development.
Talks
Skills
- Big data skills
- Hadoop, HDFS, Map/Reduce, Spark, Hive, Pig, Sqoop, Hbase, Zookeeper, Shark, MLLib, YARN
- Languages
- Java, Scala, JavaScript, C, C++
- Java frameworks
- JSP, Servlet, Hibernate
- Architecture
- Apache Tuscany Web service, REST, MEAN stack
- Mobile
- Android development
- Front end
- Prototype.js, Bootstrap, JQuery
- Databases
- Mysql, Oracle
- Development tools
- Git, Jenkins, Eclipse, Intellij