CURRICULUM VITAE Saikumar Allaka Email id:saikumar.allaka2011@gmail.com Mobile: 8125513816 Professional Summary Joined in Tech Mahindra on Nov 30, 2010. Have 3.3 years of experience in various technologies. Technically have strong/sound knowledge in developing Map Reduce programs, Pig. From the past 1 year I am working on Business Analytics projects. I have acquired good knowledge in R Lang, Text and Data Mining, Big Data and Machine Learning Techniques. Worked on Game Loyalty Program project to identify loyal customers using R as the primary tool and JGR for Visualization. Worked on Stocks Portfolio Optimization to minimize risk and get good returns. Worked on Big data projects using Map Reduce, Pig, and RHadoop. Currently I am working on Recommender systems. Have 10 months of project experience on Charge@OnceMediate (CMD) NOKIA SEIMENS NETWORKS Java Mediation tool 8 months of experience in Citibank project migration of data from Anacomp to CMOD, acquired skill on Content Management Tool CMOD and UNIX shell scripting. Around 8 months of experience in python Scripting language. Have 6 months of experience in content management. Around 1 year of experience in Hadoop ecosystem. Served clients in the telecom, banking and financial services. Good knowledge on Basic Unix Commands, Core Java. Good knowledge in big data concepts and have in-depth understanding of big data use case scenarios. Good understanding of HDFS, Map Reduce. Have experience in writing HQL queries. Have good experience in Map Reduce using python and Java. Good knowledge in PIG, HIVE and SQOOP. Strong debugging and problem solving skills. Self-motivated and able to work independently and as a member of a team. Excellent communication, interpersonal and analytical skills. Quick learner and adaptive to new and challenging technological environments. Ability to prioritize, multi-task, work under pressure, and deliver on time. Skill Profile – Technical Programming Languages Core Java, R Scripting Languages Basic Unix Shell Scripting, Python Framework Hadoop Big Data technologies Hadoop, HDFS, Hive, Pig, MapReduce, sqoop, RHadoop. Visualization tools Tableau, JGR, R Numerical computation tools Octave, R Operating System Linux, CentOS, Windows xp, vista and 7. Web Technologies& Scripting Languages Python, JSP and Servlets. RDBMS Oracle IDE / Tools & Utilities Eclipse, Net Beans, R Studio, JGR. Machine Learning Techniques K Means Clustering, Kernel K Means Clustering, Naïve Bayes Classifier, Linear Regression, Logistic regression, Decision Trees, Random Forests,Neural Networks Association Rules, Market Basket Analysis. Predictive Analytics Time Series Analysis, Linear Regression, ARIMA. Optimization and Decision Analysis Linear Optimization, Quadratic Optimization, Genetic Optimization, Goal Programming, Data Envelopment Analysis. Certifications: Big Data Analytics - Demos Hadoop and Amazon Cloud Hadoop Fundamentals -1 Data Science Computing For Data Analysis Qualifications: Bachelor of Technology, 2006-2010 Board of Intermediate Education, 2004-2006 Secondary School Education, 2003-2004 VNR VJIET, JNTU ECE 67.9% Nalanda Junior College. (M.P.C) 94.5% Pragathi Vidya Niketan -- 84.3% Professional Experience: Client Project Role Period INSOFE Game Loyalty Program Team Member 6 months Description: Identify most loyal customer for the gaming company based on its customer’s details and their previous game preferences. Roles and Responsibilities: Use the customers demographic and their past game preferences to discover relationship between the know characteristics of a customer to their reaction to the loyalty program assuming that customers demographic and game play preferences would help us define loyal customers. Environment: R Lang, JGR, Tableau Client Project Role Period INSOFE Stocks Portfolio Optimization Team Member 6 months Description: To identify the best portfolio from the available stocks by minimizing risks and get best returns. Roles and Responsibilities: Compute return, risks and create graphs for each stock. Use linear and quadratic libraries to compute the best portfolios for a given goal. Code a genetic algorithm from ground up for selecting portfolio with least VAR, least risk and highest return. Environment: R Lang Client Project Role Period Tech Mahindra Internal Crime Incident Analysis Team Member 10 days Description: Analyze crime incidents that happened in the city of San Francisco in the last 3 months Roles and Responsibilities: To find relative frequencies of different types of crime incidents, Crime occurrence frequency as a function of day of the week, Crime occurrence frequency as a function of hour of the day, Regression model for predicting the category (& occurrence) of crime based on the remaining attributes. Environment: Hive , sqoop, RHadoop. Client Project Role Period Tech Mahindra Internal Big Data – Hadoop Competency Team Member 8 Months Description: The objective of the project is to learn, experiment and analyze the data collected from State Bank of India and NSN Roles and Responsibilities: I have processed Big Data using Map Reduce frame work, Pig and calculated the transactions based on the region. In NSN project, daily around 10 GB of data is generated and which is stored on hadoop cluster and analyze the data and perform health check up for all the servers present in different locations of Canada. Environment: Hadoop, Ubuntu Linux , pig and python. Client Project Role Period Tech Mahindra Internal – Data Mining Twitter Sentiment Analysis Team Member 20 days Description: access the twitter Application Programming Interface (API) using python. Estimate the public's perception (the sentiment) of a particular term or phrase. Roles and Responsibilities: Analyze the relationship between location and mood based on a sample of twitter data. Environment: Python MapReduce framework Client Project Role Period Amazon - Predictive Analytics Predict an employee’s access needs, given his/her job role. Team Member 2 months Description: The objective of this project is to build a model, learned using historical data that will determine an employee's access needs, such that manual access transactions (grants and revokes) are minimized as the employee's attributes changeover time. Roles and Responsibilities: The model should take an employee's role information and a resource code and will return whether or not access should be granted. Environment: Python Client Project Role Period Tech Mahindra Internal click stream data Analysis Team Member 40 days Description: http://www.usa.gov/About/developer-resources/1usagov.shtml describes click stream data obtained from US government web sites. Store this data on Hadoop and analyze top 10 most popular sites in terms of clicks, top 10 most popular sites in each country, top 10 most popular sites for each month. Roles and Responsibilities: Analyze top 10 most visited websites and visualize it. Environment: Python, Java MapReduce framework, pig, R. Awards &Achievements Got 1st rank in Amazon Ninja Code competition held by Amazon with 2000+ participants. Got 128th rank in Quantium Hackit! 2013Competition held by Quantium with 1000+ participants. Good performance in competitions held in Kaggle by Amazon and yelp. Got appreciation from Program Manager for completing the given task 1 month ahead. Personal Details: Date of Birth June 29, 1989 Marital Status Single Nationality INDIAN Current Location Hyderabad Languages known English, Hindi, Telugu.