Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. 14th January 2014 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. TABLE OF CONTENTS Infosys BigDataEdge™ - Introduction ......................................................................4 About Infosys BigDataEdge™ ..................................................................................................... 4 Infosys BigDataEdge™ - Features .............................................................................................. 4 Infosys BigDataEdge™ - Outcomes ............................................................................................ 5 Infosys BigDataEdge – Key capabilities .................................................................................... 6 Infosys Big Data™ Product Development, since March 2012 ................................................... 7 Leveraging Value from Infosys BigDataEdge™ ......................................................................... 8 About Infosys BigDataEdge™ Team, Methods and Solutions ................................................... 9 Infosys BigDataEdge™ - Business Strategy ............................................................9 Infosys present Business Strategy using BigDataEdge™ .......................................................... 9 Business Strategy / Actions performed by Infosys Executives with BigDataEdge™ .............. 10 Infosys Big Data Framework ................................................................................................... 11 Infosys BigDataEdge™ Adoption Enablers .............................................................................. 11 Infosys BigDataEdge™ Architecture ........................................................................................ 12 Infosys Capabilities................................................................................................................... 12 Infosys SocialEdge – Big Data Analytics Platform .................................................................. 13 Infosys Corporate Level Strategies .......................................................................................... 13 Infosys BigDataEdge™ - Technology Partner ..................................................... 14 Infosys BigDataEdge + Oracle ................................................................................................. 14 Infosys BigDataEdge™ + Oracle Technology Model (as of December 2013) ......................... 15 Infosys BigData™ Global Landscape ...................................................................... 16 Infosys Cloud & Big Data™ IT Infrastructure Management ................................................... 17 Infosys BigDataEdge™: Big Data projects executed in ‘Information Management CoE’ in Infosys labs located in Quincy, MA 02169, USA ....................................................................... 18 Infosys’ Hadoop Roadmap started since February 2011 ........................................................ 20 Infosys BigDataEdge™ in Banking and Finance vertical using Big Data and NoSQL technologies.............................................................................................................................. 20 Infosys BigDataEdge™ Key Projects from CA, USA ................................................................. 20 Infosys BigDataEdge™ and CRM Analytics projects served since 2012, from Infosys office located in Plano, Texas, USA .................................................................................................... 22 Infosys BigDataEdge™ use Microsoft’s internal Big Data Analysis platform called Cosmos in Log Retention Service............................................................................................................... 22 Infosys BigDataEdge™ use Oracle Exadata in CA, USA ........................................................... 23 Infosys BigDataEdge™ - Digital Marketing Strategy .............................................................. 24 Infosys BigDataEdge™ - Internal training program on Big Data is called ‘Infosys at the Cutting Edge’ ............................................................................................................................ 24 Infosys BigDataEdge™ - Customer Intelligence ................................................ 25 Partial list of customers using Infosys BigDataEdge™ ........................................................... 25 1 - Infosys BigDataEdge™ serves Wells Fargo Bank, USA using MongoDB in ‘FOREX Position Risk Management Application................................................................................................. 27 2 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. 2 - Infosys Technologies Ltd serves Bank of America (BOA) using BigDataEdge™ ............... 35 3 - Infosys BigDataEdge™ serves American Express (AMEX) ................................................. 37 4 - Infosys BigDataEdge™ serves SunTrust Banks Inc ............................................................ 38 5 - Infosys Technologies Ltd serves Apple Inc using BigDataEdge™ ...................................... 39 6 - Infosys BigDataEdge™ product is used in T-Mobile’s Big Data - Web Analytics .............. 43 7 - Infosys BigDataEdge™ product is used in GAP Inc for ‘Mainframe Re-hosting Program’ 44 8 - Infosys BigDataEdge™ - Strategic Insights Practice of Campaign Analytics for Ally Financial, Ally Bank ................................................................................................................. 45 9 - Infosys serves Sears Holdings Corporation using IBM Infosphere DataStage 8.0.1 and preparing PoC using Infosys BigDataEdge™ ........................................................................... 46 10 – Infosys BigDataEdge™ serves Deutsche Bank using ‘Data Analysis Application’ .......... 47 11 - Infosys performs ‘Investor Portfolio Analysis’ and ‘NAV Prediction of omnibus funds’ for DBS Bank Ltd ............................................................................................................................ 47 12 - Infosys serves its customers using Weblog Analytics ...................................................... 48 13 - Infosys BigDataEdge™ - Strategic Insights Practice of Web Analytics for a Legal Services Customer .................................................................................................................... 48 14 - Infosys BigData™ serves its customers from its Infosys Office in CA, USA in “Foreign Exchange Risk Management Banking Domain ....................................................................... 49 15 - Infosys BigDataEdge™ serves its customers in ‘Mortgage and Online Credit Applications’ ............................................................................................................................. 50 Infosys BigdataEdge™ – Global Services IT Outsourcing Intelligence..... 51 Infosys BigDataEdge™ serves Fidelity Investments Company in project ‘Sentiment Analysis’ ................................................................................................................................................... 51 Infosys BigDataEdge™ serves Walmart Stores using largest ‘Online Transformation Program’ ................................................................................................................................... 58 Infosys BigDataEdge™ - Cloud Computing COE ...................................................................... 59 Infosys BigDataEdge™ serves UL by performing Test Report Analysis .................................. 59 Infosys BigDataEdge™ serves AMP, USA by performing Customer Churn Analysis .............. 60 Infosys BigDataEdge™ serves T-Mobile in project ‘T-Mobile Big Data Analytics’ ................. 60 Infosys BigDataEdge™ serves Australian Bank using ‘Online Transformation Program’ .... 61 Infosys BigDataEdge™ - India Operations ........................................................... 61 Infosys BigDataEdge™ - Cloud and Big Data Practices from Pune, India .............................. 61 Quick facts ................................................................................................................................ 64 Contact Details ......................................................................................................................... 65 3 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. Infosys BigDataEdge™ - Introduction About Infosys BigDataEdge™ Infosys BigDataEdge™ focuses on enabling the enterprise to device in-sights by augmenting the in-house structured trusted data with un-structured data within and outside the enterprise in a real time. Infosys BigDataEdge™ enables enterprise to adopt Big Data Technologies seamlessly by providing them the benefits of standardization, Re-use, Accelerators and Framework to realize their Big Data Journey at 1/6th the Cost and with 8 times faster time to market. BigDataEdge™ provides a GUI approach to Cloud Computing. The data from internal sources (Example: local data from desktop) and external sources (Example: Facebook, Twitter, LinkedIn, Salesforce, etc) can be loaded to cloud cluster using its GUI interface after applying particular algorithms to it. BigDataEdge™ product enables user to execute a large number of algorithms in Hive, Cassandra, Mongodb, HDFS etc with simple drag and drop option and also creates visualization based on the result. Infosys BigDataEdge™ - Features Metadata driven Pre-built Adapters for Data ingestion from various sources Discover data using auto-clustering, ad-hoc query, search, entity extraction, relationship-mining Pre-built components for Stream Processing & Real Time Analytics Pre-built transformers for data transforming and cleaning Record Linkage, De-duplication components for data enrichment Comprehensive & easy to use Analytical & Machine Learning algorithms support Seamless Integration of Analytics Tools like R with Hadoop Graphical easy to use User Interface with drag and drop features for configuring data pipelines Industry leading Visualization techniques for deep insights Collaboration platform for collective decision making Integration with Enterprise systems for Real-Time Decision making 4 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. Insight Governance and Management One-Click Cloud Deployment - Seamless Analytical Cluster Setup, Configuration and Full Featured Hub Management Infosys BigDataEdge™ - Outcomes Discover Batch Data Adapters Event Stream Adapters Search Entity extraction Ad-Hoc Query Auto clustering Process Generic Transformers Data Cleansing Data Masking Content Extractions Change Capture Unique ID Analyze Text Analytics Clustering Pattern Matching Classification Recommendations Time Series Analysis Regression Analysis 5 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. Visualize Normal Visualizations Cluster Visualizations Geospatial Graphs Drill Down Collaboration Institutionalize Send Alert Adapter Integrate with Enterprise Workflow system Send Emails Adapter Integration Adapters with Enterprise systems Infosys BigDataEdge – Key capabilities Discover and Aggregate Real-time data discovery o 50 out-of-the-box connectors Extensible connector framework o Rapid addition of new information sources o Automates 80% of connector functions Transformation builder o Visual approach to build transformations o 70 reusable components Process: Meta-data based Virtual Data Source o Process at the source o Reduce Infrastructure cost by 30% Analyze: Insight Builder o 250 pre-built algorithms 6 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. o Drag and drop visual interface o Empower business analysts to build insights Visualize Insight Builder o 50 ready-to-use visualization options o Traceability of insights o Executive dashboard o Smart device access Operationalize Collaboration Wall o Cross-functional collaboration o Real-time decisions Enterprise workflow o Influence systems using pre-built connectors Uniqueness of Infosys BigDataEdge™: Agility: Actionable intelligence from new data in hours Speed: Expedite your processing deployment across all data types by 10X or more Business Value: Monetize your data through applied insights Infosys Big Data™ Product Development, since March 2012 Technology Modules in Infosys Big Data™ Product 1. Metadata driven Ingestion Framework 2. Metadata Management 3. Dynamic Validation Framework 4. Big Data Processing Accelerators 5. Machine Learning Processing Toolkits 6. Search and Indexing on Big Data 7. Text Analysis (including Identity Resolution) 8. On Demand Analysis and Dynamic Visualization Framework 9. Information Service Framework 10. Real-Time Analytics Framework 11. Automated Cluster Deployment 7 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. JDK / JRE – 1.6 Cloudera Hadoop 3.0 Map Reduce R RHIPE RHBASE AVRO Hive 3.0 Pig Eclipse JSON formats XML Output Formats HDFS Oozie RMR RHDFS SQL Tableau Infosys creates Ingestion, Transformation Components on BigDataEdge and developed MapReduce programs for various Input formats and algorithms. Infosys created various components using Map Reduce, Avro, Hive, Oozie and the analytics using R, R integration with Hadoop like RHDFS, RMR, RHipe and also set-up RStudio Server on Hadoop Cluster. Infosys created a single stop solution for all Big Data related requirements which is 8x faster. Infosys use Tableau for dynamic generation of graphs over JQuery that resulted in sizable time-cost benefits Leveraging Value from Infosys BigDataEdge™ Integrate data silos across the research organization to allow scientists easy access to the wealth of information available o Also the ability to capture and ingest data from multiple structured and unstructured data sources Develop and place flexible and broad high throughput analytical layers on the integrated data sets to mine the information o Continue to allow the researchers the flexibility of choosing a broad array of analytical tools that fit their research needs o Advanced near real time analytics over Big Data combining structured and unstructured data o Ability to process data on cloud architecture and cloud platforms Develop scalable and flexible infrastructure to support the increasing computing needs. o Bioinformatics increasingly needs Tera-Scale computing in the areas of drug discovery and drug repurposing. 8 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. About Infosys BigDataEdge™ Team, Methods and Solutions Talent: Number of Consultants: 8000+ Number of Visualization Experts: 100+ Data Scientists: 500+ Methodologies: Value Realization Method™ IMPACT™ Questions To Actions Infosys Solutions: [BigDataEdge™ + TradeEdge™ + InteractEdge + FINACLE™ + Infosys Labs + Technology COE’s + Alliances] Infosys BigDataEdge™ - Business Strategy Infosys present Business Strategy using BigDataEdge™ Infosys is deriving business value using BigDataEdge™ and performing business and technology transformation by leveraging BigDataEdge™. [Thoughts + Devices + Processes = BigDataEdge™] Big Data Value: Change the Business (Use Data) -> Run the Business (Produce Data) 1. Run the Business by organizing Data to do something specific (Produce Data) 2. Change the Business by taking Data As-Is to figure out what it can do (Use Data) Infosys BigDataEdge™ - Success Strategy Infosys BigDataEdge™ Data Architecture is saving over $250 Million for an enterprise customer in Retail business Infosys adopts State of the art architecture for Real-Time actionable insights, for its customers Infosys is helping its major retailing customer to get price competitive online 9 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. Business Strategy / Actions performed by Infosys Executives with BigDataEdge™ During past 10 months, Infosys has perfectly marketed its BigDataEdge™ to all its customers. Infosys customers in BFSI have asked for PoC (Proof of Concept). o Priority 1: Currently, as of December 2013, Infosys is working on PoC (Proof of Concept) for all its existing customers in Banking and Finance vertical. o Priority 2: Infosys is focusing on close related ‘Competitors’ of existing Customers for whom the BigDataEdge™ is implemented during 2013 o Priority 3: Infosys is performing extensive research on Enterprise customers who are making Big Data Initiatives and are willing to use BigDataEdge™ during 2014. o Priority 4: Infosys is identifying Enterprise customers who are setting up CoE, Research labs, Innovation Centers, etc. Infosys is developing customized unique Big Data Strategy for each and every customer in Banking and Finance sector. Infosys is establishing and managing Big Data implementation for its customers using BigDataEdge™ Infosys is designing solutions based on a mix of Big Data and NoSQL technologies Hadoop, Vertica, MongoDb etc., for its customers in Banking and Finance vertical. Infosys BigDataEdge™ performs “Knowledge Based Solution” implementation for Failure Analysis organization to its customers. in Big Data How Infosys BigDataEdge™ achieved during 2013 and how it took advantage with Analytics-As-A-Service: During 2013, Infosys has taken advantage of customer’s non-expertise and timelessness to implement a large-scale big data strategy in-house. Customer’s lack of investments, timelessness, cost overruns, lost opportunities and failure risks have been taken advantage by Infosys BigDataEdge™ practice. During 2013, customers couldn’t realize business benefits of the entire project using Big Data. 10 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. During 2013, customers couldn’t partner with expert IT Vendor with expertise in Big Data analytics projects; or IT vendors who could take risks, low costs and timely big data implementation. Infosys Big Data Framework Infosys BigDataEdge™ Adoption Enablers I. II. III. IV. Accelerator – Solution & Expertise Services – Extreme Data Product – Voice of Customer Analytics Platform – Social Edge for Big Data 11 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. Infosys BigDataEdge™ Architecture Infosys Capabilities Global Delivery Model Training Capabilities Operational Efficiency Management/Leadership High Quality Standards 12 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. Infosys SocialEdge – Big Data Analytics Platform Infosys Corporate Level Strategies Core Strategies I. Global Delivery Model: Producing where it is most cost effective to produce & selling where it is most profitable to sell. II. Moving up the Value Chain: Getting involved in a software development project at the earliest stage of its life cycle. III. PSPD Model: “Predictability of Revenues, Sustainability of Revenues, Profitability, Derisking” for risk management. Generic Strategies I. II. Low cost Global delivery 24/7 Model Little differentiation in low-end services of value chain 13 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. III. High differentiation in high end services of value chain like software products and package solutions IV. Focus on quality, customer relationship management, timely-delivery Infosys BigDataEdge™ - Technology Partner Oracle is Technology Partner of Infosys BigDataEdge™ Infosys BigDataEdge + Oracle Value Proposition of BigDataEdge + Oracle i. 40% faster ingestion & Discovery ii. 8X faster insights iii. 250 prebuilt algorithms for rapid insights iv. Real time decisions 14 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. Infosys BigDataEdge™ + Oracle Technology Model (as of December 2013) 15 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. Infosys BigData™ Global Landscape Eclipse Netbeans JBOSS developer Red GATE Apache studio Big SDB Data Apache Jena - Fuseki Protege OpenRDF Putty Winscp Filezilla OpenNLP RestClient Firebug Postman Tomcat server JBOSS TomEE Hadoop Hive Pig Oozie Mahout Impala Zookeeper Storm - open source, big- Whirr data processing system HBase Cassandra Sqoop MongoDB Apache Mesos Rhipe Amazon s3 EC2 and EMR Cloud CDH instances (Amazon Elastic MapReduce (Amazon EMR)) Mapr HDP Apache Hadoop Intel® Distribution Apache Hadoop Cloudera Manager Ganglia BigDataEdge™ Flume for Rapid Miner Jena Exclusive Technology Landscape in Infosys Big Data™ Hadoop Hive Pig Oozie Sqoop Hbase Storm Linux Impala Java Ruby on Rails Jquery MongoDB Windows Linux JavaScript Jquery Backbone.js HTML CSS Apache Tomcat Nginx Web Server 16 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. Inclusive: Technology Landscape in Infosys Big Data™ VMware player Virtual Box Jira MediaWiki Twiki Dokuwiki Wordpress Mantis Bug Tracker Visual SVN Apache Tomcat Apache httpd JBOSS server server server server Infosys BigDataEdge™ product development started: May 2012 Product Developed launched in Market: February 2013 Infosys Cloud & Big Data™ IT Infrastructure Management Infosys performs Big Data research using Core Java, Pig scripting, Hive queries, R scripting, Rhipe programming and MapReduce programming involving summarization patterns, Data Organization patterns, Filtering patterns, Join Patterns, Meta patterns. Infosys performs RDF data processing using SPARQL, Apache Jena SDB java API, Apache Jena Fuseki Java API, Text Processing Infosys performs content extraction using GATE & JAPE Infosys performs text processing using apache Open NLP Infosys performs MapReduce orchestration using oozie Infosys performs recommendation to its customers in using Mahout Algorithms, Encog Machine Learning, JSP, HTML and jQuery Infosys executes Linux shell scripting, Python scripting, Windows Shell scripting Infosys Big Data™ serves its customer in a project for creation of an automated car using Artificial Intelligence. Infosys Big Data™ R&D delivers one of its customers in Sentimental Analysis and Tag Clouding using MapReduce and also using Hive SerDe Infosys Big Data™ R&D performs analysis using Pig scripting and creates a Workflow using Oozie. Infosys Big Data™ serves its customers in following areas, o Social Media Analytics o Analyzing the reason for churn happened to a company. o Analyses system logs generated using snort using hadoop. 17 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. o Creation of new Hadoop Clusters o Creation of Virtual Machines for Hadoop Cluster and Servers. Infosys Big Data™ is deploying Hadoop and other BigData services in Cloud Infosys Big Data™ is monitoring Hadoop clusters using Cloudera Manager and Ganglia Infosys Big Data™ use Cloudera, Apache, Hortonworks, Intel and Mapr distributions of Hadoop. Infosys Big Data™ installed and configured the following technologies Hive Pig Impala HBase Cluster Storm Cluster Sqoop Mahout Oozie Zookeeper Cassandra Cluster Cloudera Manager Free Ganglia Hue Flume R Studio RMR R - - (Hive server 1 and 2) and Enterprise Edition Rhipe programm ing Infosys Big Data™ performs cluster creation using Whirr and performs Hadoop cluster testing using Tera-sort Infosys Big Data™ executes Open NLP – Natural Language programming and Encog Programming for Machine Learning Infosys BigDataEdge™: Big Data projects executed in ‘Information Management CoE’ in Infosys labs located in Quincy, MA 02169, USA ‘Infosys BigDataEdge™ - Information management CoE’ performs BigDataEdge™ Product Engineering; conceptualizing, designing and executing the development of the IP assets in the Advanced Analytics areas. Infosys BigDataEdge™ - Research operations performed in Infosys labs: ‘Information Management CoE’ in Infosys labs performs conceptualization and design of product features of BigDataEdge™ financial services offerings. Infosys labs are mapping the business use cases with technology features, evaluating the appropriate use of techniques and algorithms for their implementation. Infosys delivered use cases like, o Enterprise risk analytics in financial derivatives 18 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. o Social media compliance o Enhanced product classification leveraging textural product description Analytical Techniques used in BigDataEdge™ Infosys BigDataEdge™ use different analytical techniques deployed by ‘Information Management CoE’ in Infosys labs in Quincy, USA o Data mining o Unstructured Analytics – Natural Language Processing o Text Mining o Neural networks o Artificial Intelligence o Bayesian networks o Forecasting and Recommendation o Image processing Infosys deploys Advanced Analytics for its BigDataEdge™ offerings, after performing intense market research. Infosys prepares multiple solution designs, develops use cases, PoC and then validates techniques. Project Onsite Location: USA Project Offshore Location: Mysore, Pune and Bangalore Project Start: February 2013 Project End: As of December 2013, Infosys BigDataEdge™ implementation and delivery are in fast progress. C C++, UNIX Windows MATLAB R Software Linux Address of Information management CoE that performs BigDataEdge™ Product Engineering: Two Adams Place, Quincy, MA 02169; Phone:+1 781 356 3100; Fax:+1 781 356 3150 19 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. Infosys’ Hadoop Roadmap started since February 2011 Infosys has started to build Hadoop Team within the organization, since February 2011 Infosys is building the Hadoop Competency across the organization (since February 2011) by training them in Hadoop and its Eco system. o Framework: Hadoop ecosystem, Cascading o Development Methodology: Scrum/Sprint Infosys BigDataEdge™ in Banking and Finance vertical using Big Data and NoSQL technologies Infosys is currently executing a multi-year $10 million program (for BOA) to consolidate multiple applications in the BOA’s domain and re-platform to Hadoop / Vertica architecture to reduce cost and improve performance. Infosys has recently executed a $10 million program (for Apple) to build a high performing common reconciliation platform for gift card transactions using MongoDB. Infosys is identifying tools; defining and implementing proof of concept (PoC) for all its existing and prospect customers in Banking and Finance vertical Infosys is designing and implementing architecture in Big Data and SOA, for key customers in Banking and Finance vertical. Infosys BigDataEdge™ Key Projects from CA, USA Infosys is architecting Big Data solutions for Enterprises using algorithms like, o GLMNET o Random Forest o Linear Regression o Clustering etc. Infosys use Big Data Technologies like R, Rhipe, RMR, Hadoop - Hive and Map Reduce Infosys is successfully implementing Scalable and High Performance Data Platforms Infosys implements next generation web BI platform for telecom service providers in USA 20 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. Infosys delivers project from its office location address: 7707 Gateway Boulevard, Suite 110, Newark, CA 94560; Phone: +1 510 742 3000; Fax: +1 510 742 3090. Technology Landscape deployed in Infosys office located in CA, USA BigDataEdge™ platform: Ambari, Zookeeper, Sqoop, Hive (Stinger), MapReduce, R Software, Rhipe, GLMNET, Mahout Algorithms, Hortonworks Data Platform 1.3 Databases Programmin g Languages Operating Systems Tools PM Tools DB2 on LUW and z/OS COBOL Linux HMC MS Projects Infosys IPM+ Oracle 10g/11g JCL AIX Ambari RPM IBM’s QMS (Quality Management System) IMS SQL Solaris Nagios ILC OPAL IDMS REXX CentOS Talend SDMS CMMI Hadoop Assembler, z/OS, OS/390, UNIX Shell Windows Scripts and Java TOAD, OEM, IBM Data NIKU, Studio and DB2Control HPOV, Centre Tools, DB2TOP, IBM DB2PD, DB2exfmt, Manage DB2expln, CA-now Platinum for DB2, and BMC Apptune, STROBE, Clarity ERWIN, FILE-AID, QMF, Omegamon, TMON Processes Clarity 21 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. Infosys BigDataEdge™ and CRM Analytics projects served since 2012, from Infosys office located in Plano, Texas, USA Infosys delivers the following major projects using BigDataEdge™ 1. Social Media Analytics/CRM solution for a global pharmaceuticals company in Philadelphia, PA, USA 2. Big Data/Analytics solution for a leading Casual Dining chain in Orlando, FL, USA 3. CRM Analytics solution for an office supplies company in Miami, FL, USA 4. Social CRM/Analytics for a Financial Services/Insurance company in USA 5. CRM/Analytics solution for a leading Japanese Auto company in Dallas, Texas, USA 6. CRM/Analytics solution for a pharmaceuticals company in New York city, USA Infosys office Location of project delivery: Plano, Texas, USA. Infosys BigDataEdge™ use Microsoft’s internal Big Data Analysis platform called Cosmos in Log Retention Service The Log Retention Service provided by Infosys is addressing the process of collecting the log data from federated services and retaining it, so that it can be provided to the syndication partners of Infosys. This aids syndication partners in fulfilling local data retention requirements. While providing the Log Retention Service, Infosys executed deployment of WCF services on to Windows Azure platform and implemented uploading logs to Microsoft’s internal big data analysis platform called Cosmos. Infosys delivers BigDataEdge™ projects from onsite location address: 3326 160th Avenue SE, Suite 300, Bellevue, WA 98008; Phone: +1 425 256 6200; Fax: +1 425 256 6201. Project Start: February 2013 Windows Azure C# WCF PowerShell MEF Windows Forms MS Build Source Depot StyleCop FxCop CodeFlow Microsoft Cosmos Address of Infosys office that use Microsoft Cosmos in BigDataEdge™ product: 3326 160th Avenue SE, Suite, 300, Bellevue, WA 98008. Phone: +1 425 256 6200. Fax: +1 425 256 6201 22 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. Infosys BigDataEdge™ use Oracle Exadata in CA, USA Infosys BigDataEdge™ is coupled with in Cloud Computing - Big Data Practice (erstwhile Performance Engineering and Enhancement practice) Project Management activities are, o Oracle Exadata optimization o Oracle Database Performance optimization o SQL Tuning and PLSQL o Performance Consulting Services o Performance Assessment o Performance Benchmarking o Performance Tuning o End-to-End performance engineering life cycle o NFR gathering and Validation o Performance modeling o Hardware sizing and capacity planning o Performance testing strategy o Performance testing using load-runner o Data modeling Infosys delivers project from its office location address: 7707 Gateway Boulevard, Suite 110, Newark, CA 94560; Phone: +1 510 742 3000; Fax: +1 510 742 3090. Hadoop Pig and HP, IBM, Sun Windows servers and Linux, HPUX, AIX, Solaris, Windows, RHL Oracle Exadata, Oracle 11g 10g / 9i, / Oracle TimeTen IMDB Oracle 10g AS, Weblogic 8.1 Mercury Load Runner, Oracle 9i Developer Suite, Oracle Designer6i, Mercury Test Director, HP Glance Plus. PL/SQL, SQL, Statspack / AWR, Tkprof, Oracle Sar, Vmstat, Lostat, Forms, Pro*C, Unix Perfmon Shell Programming 23 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. Infosys BigDataEdge™ - Digital Marketing Strategy Digital Marketing – Products, Platforms and Communities Team in Infosys launched BigDataEdge™, Cloud Ecosystem Hub and Finacle Digital Commerce in the span of one year, which included o Website launch o Digital outreach and branding / lead generation through LinkedIn, Google PPC, etc. Infosys garnered over 7K visits to the BigDataEdge™ site and 100+ leads in two weeks after launching in February 2013. Infosys BigDataEdge™ - Internal training program on Big Data is called ‘Infosys at the Cutting Edge’ Infosys’s internal training program on Big Data called ‘Infosys at the Cutting Edge’ is the initiative taken by the CTO of the Retail Unit, during 2012. Top Performers across all business verticals are recognized in a program titled, ‘A Day in the Life of a Leader’ during 2012. Top performers in Big Data Practice were also recognized in this program during 2012. Top Performing employees working at the client sites are recognized in a program titled, “Client Superstars of the Month” during 2012. Top performers in Big Data Practice, who are located in Client locations, were also recognized in this program during 2012. 24 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. Infosys BigDataEdge™ - Customer Intelligence Partial list of customers using Infosys BigDataEdge™ SI # Customer Name Customer Satisfaction Index 1 Bank of America 7/7 2 SunTrust Banks Inc 7/7 3 Aetna 7/7 4 Humana 7/7 5 McKesson 7/7 6 Wells Fargo 7/7 7 Fidelity Investments 7/7 8 PMI Group 7/7 9 First USA (currently JPMC) 7/7 10 Deutsche Bank, USA 7/7 11 CapitalOne, USA 7/7 12 Ally Bank, USA 7/7 13 Shinsei Bank, Japan 7/7 14 Wachovia Bank, USA 7/7 15 Nokia, Asia Pacific 7/7 16 Cathay Pacific 7/7 17 DHL Worldwide 7/7 18 Apple Inc 7/7 19 Xerox 7/7 20 Motorola 7/7 21 T-Mobile 7/7 22 Cisco 7/7 23 Ahold, Netherlands 7/7 24 Pfizer 7/7 25 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. 25 GAP Inc 7/7 26 SuperValu 7/7 27 DIRECTV 7/7 28 Erricson 7/7 29 Pfizer Inc 7/7 30 Deutsche Bank 7/7 31 British Telecom 7/7 32 Sainsbury’s 7/7 33 GMAC Financial Services 7/7 34 Toyota 7/7 35 PricewaterhouseCoopers 7/7 36 Baker Hughes 7/7 37 CardinalHealth 7/7 38 American Express 7/7 39 Deutsche Bank 7/7 40 DBS Bank Ltd 7/7 41 Walmart Stores 7/7 42 UL 7/7 43 AMP, USA 7/7 44 Australian Bank 7/7 26 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. 1 - Infosys BigDataEdge™ serves Wells Fargo Bank, USA using MongoDB in ‘FOREX Position Risk Management Application Infosys has built its team; prepared project scheduling; performed design and development of the ‘Forex Position Risk Management application’ Infosys has built a strong technology team using Mongo DB, Eclipse RCP, Oracle Coherence CEP, Distributed Computing, Messaging Systems Java MongoDB Spring Oracle Coherence Scrum Maven GUI: Eclipse RCP Anthill SQL Server My Batis Nexus Quartz Infosys delivers project from its office location address: 13777 Ballantyne, Corporate Pl, Suite 250, Charlotte, NC 28277, USA; Phone:+1 704 972 0320; Fax:+1 704 972 0311 Project Start: August 2011 Project Delivery – 1st Phase: October 2012 Infosys served Wells Fargo Bank, USA by delivering ‘Enterprise Reporting Platform’ Infosys designed and developing an Enterprise Reporting Platform across the organization of Wells Fargo, USA Spring Jasper Reports Scrum Maven Nexus Anthill Infosys delivers project from its office location address: 13777 Ballantyne, Corporate Pl, Suite 250, Charlotte, NC 28277, USA; Phone:+1 704 972 0320; Fax:+1 704 972 0311 Project Start: April 2011 Project End: August 2011 Infosys BigDataEdge™ use ‘Oracle’ while serving Wells Fargo Bank, USA Wells Fargo runs Oracle Application Express 4.1 on Oracle Real Applications Clusters (Oracle RAC) 11g Release 2. It uses the Oracle Application Express interface to provide a portal for the Wells Fargo Database Neighborhood (DAN). Among the key portal components are, o Management of the scheduling, data migration, add-target requests, and communications for the Oracle Enterprise Manager 12c Cloud Control migration o Interface for creating production support e-mails to DAN customers 27 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. o Oracle patch set reporting used to identifying where Oracle Database PSUs/CPUs are missing o Reporting for host-level jobs running in the DAN o Reporting for the policing of resource utilization in DAN Wells Fargo Bank use Database Area Neighborhood (DAN) o DAN provides infrastructure services for Oracle Real Application Clusters o DAN is supporting: 5 Lines of Business 45 Oracle DBAs 1100 Databases 430 hosts 91 clusters Enterprise Data Management (EDM) DAN Infrastructure Services Operations Community Banking Corporate Wealth Consumer Lending APEX Infrastructure Components DAN MetaData Interface via Oracle Application Express Oracle APEX v4.1.1 Oracle Application Server 10gR2 with PL/SQL Toolkit (mod_plsql) Oracle RAC Database v11.2.0.2 Real Application Cluster (RAC) v11.2.0.2 Application Express Plan o Map out how your application will flow Design o Make use of key APEX components Deliver Solutions o Understand how you can best meet your customer’s needs 28 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. Data Consolidation using DAN MetaData via Oracle APEX Best Practices Implementation of Oracle a. Utilize PLSQL procedures and packages for application logic b. Keep complex queries in views c. Maintain a consistent look and feel for interface i. Theme ii. Navigation iii. Formatting with CSS d. Make use of built in features such as Interactive Reports e. APEX Advisor f. Monitor Activity of the Application g. Slow Page, check underlying queries h. Debug Mode for problem analysis i. Code Review 29 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. Infosys BigDataEdge™ serves Wells Fargo Bank, USA using Oracle Active Data Guard for Data Protection About Wells Fargo - Enterprise Data Management (EDM): Centralized Technology Group Supporting Multiple Lines of Business Support 2000+ Oracle Databases Across 10 Data Centers in the U.S. Project status during 2012: Wells Fargo initiated a project for the brokerage to migrate away from using costly hardware-based replication and an “All or Nothing” data center failover model for Disaster Recovery (DR). Project Goal: Enable a more flexible component level failover Reduce overall costs Leverage the DR hardware to improve ROI Previous Solution Architecture in Wells Fargo Bank Project Requirements: Consolidate Multiple Databases Clusters Reduce Production Load by Offloading Backups & Ad/Hoc Queries Reduce DR Testing Requirements 30 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. Support Component Level Failover Shared Services RO calls must run Locally due to Response Time SLA Ease of Maintenance Present Solution Architecture (as of December 2013) Key Points: During OLTP window, database averages 750 commits and 3MB of Redo per second. Standby side is able to maintain an Apply Lag <= 1 second. The ability to offload backups to standby, reducing physical reads on the primary cluster by 18%. User Reports and Ad/Hoc queries make up approx 5% of workload. They are now able to be isolated to standby cluster to limit impact to online activity. The ability to monitor Active DR Databases and reduce the frequency for DR testing. Success factors: Tuning Redo Transport to limit Apply Lag o Enabling Redo Transport Compression Helped lower bandwidth on average by 2.5x 31 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. Standby DB are able to catch up much quicker from Redo spikes o DB Parameter “LOG_ARCHIVE_MAX_PROCESSES” Default value = 4 Increased value helped with heavy redo spikes during batch activity The ability to backup and recover a database from either cluster o All backup files are cataloged and available to both clusters o Makes for EASY re-instantiation and incremental recovery of very large databases Proper TNS Configuration o Leverages Role Based Named Services o Descriptions list the Local Cluster first o Tune Connection timeouts to limit login time when primary cluster is offline. o CONNECT_TIMEOUT, TRANSPORT_CONNECT_TIMEOUT & RETRY_COUNT PROD_Reports = (DESCRIPTION_LIST = (LOAD_BALANCE=off) (DESCRIPTION = (LOAD_BALANCE=on) (CONNECT_TIMEOUT=5) (TRANSPORT_CONNECT_TIMEOUT=3) (RETRY_COUNT=0) (ADDRESS_LIST = (ADDRESS = (PROTOCOL = tcp) (HOST = LOCAL-SCAN) (PORT = 1521))) (CONNECT_DATA = (SERVICE_NAME=PROD_REPORTS))) (DESCRIPTION = (LOAD_BALANCE=on)(RETRY_COUNT=2) (ADDRESS_LIST = (ADDRESS = (PROTOCOL = tcp) (HOST = REMOTE-SCAN) (PORT = 1521))) (CONNECT_DATA = (SERVICE_NAME=PROD_REPORTS)))) Configure services to auto-start based on database role to manage where clients can connect, no system event triggers needed: srvctl modify service -d db_unique_name -s PROD_RW -l PRIMARY srvctl modify service PHYSICAL_STANDBY -d srvctl modify service -d PRIMARY,PHYSICAL_STANDBY db_unique_name -s db_unique_name PROD_REPORTS -l -s -l PROD_RO 32 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. The ability to monitor and report on standby apply lag: o Grid Control to alert when apply lag exceeds SLA o Dictionary views to monitor and troubleshoot issues V$DATAGUARD_STATS – Shows current status V$STANDBY_EVENT_HISTOGRAM – Histogram of Apply Lag V$ARCHIVE_GAP – Any archive gap blocking recovery Infosys BigDataEdge™ use Oracle Database 12c (today and in Future) for Wells Fargo Bank Writeable Global Temp Tables on the Physical Standby to Enable Additional Offloading of Reports Better Audit Collection and Management for Physical Standby Activity Simplified Rolling Upgrades New features to be included in Wells Fargo using Active Data Guard Protection and Increased Return On Investment o Zero-Data-loss protection at any distance o Writing to global temporary tables o Using unique sequences o Real-time Cascading Monitoring and Manageability o New capabilities of the Data Guard Broker Planned Maintenance o Automated Rolling Database Maintenance EDA (Enterprise Data Analytics) at Wells Fargo Bank Infosys BigDataEdge™ is creating PoC for using Big Data technologies in Wells Fargo’s EDA (Enterprise Data Analytics) project. Infosys BigDataEdge™ is developing MapReduce programs for extracting and analyzing data in Aster hadoop platform. Infosys BigDataEdge™ is designing and implementing methodologies new ETL process and 33 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. Infosys BigDataEdge™ is designing and developing application layer for six different projects on top of the existing EDW Infosys BigDataEdge™ has designed and developed ETL jobs using Abinitio and Terdata BTeq scripts Infosys BigDataEdge™ serves Wells Fargo’s business users on ADHOC data pulling and loading data. Technology Landscape: AbInitio (GDE (3.1.3) and Co. Op Oracle 10G 3.1) Teradata UNIX Script Shell Pre populate risk model (PPRM) project during 2010: The purpose of Pre populate risk model (PPRM) project is data integration of WFE (Wachovia) and Wells Fargo for Fraud detection and prevention. AbInitio (GDE (3.1.3) and Co. Op 3.1) Oracle 10G UNIX Shell Script - MSR (My spending report): MSR (My spending report) is an online banking solution which helps to analyze each customer’s spending patterns. AbInitio (GDE (3.1.3) and Co. Op 3.1) Oracle 10G Teradata UNIX Shell Script Key number about Wells Fargo Bank (as of June 2013) Assets: $1.4 trillion Team members: More than 275,000 Customers: 70 million Wells Fargo Bank - Digital Banking Strategy (as of December 2013) Digital Banking Strategy of Wells Fargo is to re-assess, re-imagine, and re-launch the online banking offering for 70 millions of consumers, taking into account the rapid adoption of social media, mobiles, and new devices such as tablets. 34 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. Digital Banking Strategy Team of Wells Fargo is analyzing and evangelizing Social Media trends, adoption by customer segments, impact on customer behavioral patterns, emerging best practices both within and outside banking, and identified related business opportunities. Wells Faro Bank is prototyping customer digital experience and instilling social plug-ins. Digital Banking Strategy of Wells Fargo is building the multi-million business case to transform the digital channels and upgrade the digital user experience for 20MM+ digital customers. Digital Banking Strategy of Wells Fargo has valuated analytical capabilities to better target and deepens the relationship with customers by leveraging marketing analytics and enabling better personalization and contextualization of product offering. 2 - Infosys Technologies Ltd serves Bank of America (BOA) using BigDataEdge™ Infosys is serving Bank of America (BOA) during the past 13 months (since November 2012, even before BigDataEdge™ is marketed as a brand, in the media during February 20, 2013) Infosys serves Bank of America (BOA) to re-platform Teradata applications to Vertica / Hadoop based architecture and consolidates multiple warehouses in the BOA’s domain. During December 2013, Infosys BigDataEdge™ team is extensively working to perform End-to-End development for Bank of America(BOA), o Stage 1: Initial proposal preparation and submission o Stage 2: Business Case Development o Stage 3: Architecture definition o Stage 4: Implementation of roadmap definition using Big data technologies (BigDataEdge™) o Stage 5: BigDataEdge™ Implementation in multiple location of BOA. Project Onsite Location: USA Project Offshore Location: -Not Available- Project Start: November 2013 35 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. Project End: -Not known- (From January 2014, Infosys BigDataEdge™ will perform implementation and delivery, in fast progress). Infosys is delivering BigDataEdge™ to BOA from Atlanta, GA, USA: Address - 3200 Windy Hill Road SE, Suite - 100W, Atlanta, GA 30339, USA; Phone:+1 770 799 1860; Fax:+1 770 799 1861 Infosys BigDataEdge™ serves Bank of America (BOA) using “Customer Data Mart (CDM)” The Customer Data Mart (CDM) project’s objective is to have a one-stop shop for all customer related information that would provide a 360 degree view of customer related data. This was done post the 2008 downturn to avoid a scenario where the bank had encountered losses due to the lack of comprehensive customer information. The CDM application sources data from various source systems like UNIX and Mainframes that send all related data through files or even directly pull from source database tables. Data is fetched across different lines of business, processed by subjecting it to transformations, organized it in a customer oriented format and sent to the downstream systems. This data is used in risk analysis for a customer, and used to make key decisions in adjudging customer eligibility for loans, mortgages, card applications etc and credit reporting/analysis. Customer Data Mart (CDM) project involves migration of one of their unit from Teradata to HP Vertica as Vertica offers increased security features and benefits. Infosys performed consolidation of CDM and C&RA (Customer and Relationship Application) on a single platform CDH (Customer Data Hub). Hadoop Teradata HP Vertica BTEQ Scripts Autosys Linux Infosys’ large engagements for BOA: Maintain Customer Information for Consumer and Small Business Banking customers of Bank of America The Customer Information Systems (CIS) is a shared services group within Consumer and Small Business Banking segment of Bank of America, servicing many channels like Banking Centers, Online Banking, ATM, VRS, etc. CIS was using KTC (Know The Customer) which is a Siebel CRM based system to manage customer information. Over the years, the KTC system became so huge and complex; it became challenging to manage. 36 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. The Customer Information Systems (CIS) management took a strategic decision to replace KTC with a Copernicus model having WCC (IBM Websphere Customer Center) as a hub containing core customer information and many Federated Applications containing different functions around WCC. Scope of Work: o (1) implementing and customizing WCC o (2) Building new Federated Applications CNE (for Alerts) OFR (for offers) FRE (for risks) SPX (for Combined Statements and Relationship Pricing) IDV (for identity validation) o (3) Integration with other systems due to mergers and acquisitions o (4) Implementing new business requirements. Onsite Location: BOA, Charlotte, USA Offshore Location: Bangalore, Chennai, Hyderabad, India Project Start: February 2009 Project End: December 2013 Number of Resources deployed since the Project started: 160+ J2EE Web Services COBOL DB2 JCL Oracle IBM Websphere Application RAD Server Rational Clear Case Quality Center FieldGlass Microsoft Visio Microsoft Project 3 - Infosys BigDataEdge™ serves American Express (AMEX) Infosys BigDataEdge™ is providing more accurate Business Intelligence (BI), performance efficiency and technical capabilities of the Hadoop platform. SAS Revolution R Protegrity Dataguise Open Source R Datameer KarmaSphere Microstrategy Informatica Sync Sort Infosys provides ‘Performance Architecture’ for American Express The customer (AMEX), an engine of commerce, provides innovative payment, travel and expense management solutions for individuals and businesses of all sizes. 37 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. The customer engaged Infosys to redesign their application so that it can scale up to handle the increased load in the system due to open up the system to web users. Infosys executes database administration activities for both DB2 on z/OS and UDB on LUW Infosys provides performance improvement recommendations by analyzing existing system Infosys performs workload management and analysis IBM mainframes – z/OS COBOL JCL DB2 PL/SQL Stored Procedures DB2 UDB Unix MS Office suite BMC / Apptune BigDataEdge™ - Customer Satisfaction Index: 7/7 Project Start: June 2009 Project End: March 2010 4 - Infosys BigDataEdge™ serves SunTrust Banks Inc Infosys BigDataEdge™ is serving SunTrust Banks Inc during the past 17 months (since August 2012) even before BigDataEdge™ is marketed as a brand, in the media during February 20, 2013) Infosys serves SunTrust Banks Inc to implement a validation and an application of Big Data technologies (BigDataEdge™) for Risk Management. During December 2013, Infosys BigDataEdge™ team is extensively working to deliver this project during June/July 2014, o Stage 1: Initial proposal preparation and submission o Stage 2: Defined Proof of Concept o Stage 3: Defined Architecture o Stage 4: Defined roadmap for BigDataEdge™ implementation at the SunTrust Banks Inc. o Stage 5: BigDataEdge™ Implementation in multiple location of SunTrust Banks Inc. Project Onsite Location: USA Project Offshore Location: -Not Available- Project Start: August 2012 38 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. Project End: 2014 (From January 2014, Infosys BigDataEdge™ will perform implementation and delivery, in fast progress). Infosys is delivering BigDataEdge™ to SunTrust Banks Inc from Atlanta, GA, USA: Address - 3200 Windy Hill Road SE, Suite - 100W, Atlanta, GA 30339, USA; Phone:+1 770 799 1860; Fax:+1 770 799 1861 5 - Infosys Technologies Ltd serves Apple Inc using BigDataEdge™ Infosys is serving Apple Inc during the past 17 months (since August 2012) even before BigDataEdge™ is marketed as a brand, in the media during February 20, 2013) Infosys serves Apple Inc to build a reconciliation platform using NoSQL technologies (MongoDb) and techniques - to improve performance for processing gift card transactions in US Market. During December 2013, Infosys BigDataEdge™ team is extensively working to deliver this project during June/July 2014, o Stage 1: Initial proposal preparation and submission o Stage 2: Defined Proof of Concept o Stage 3: Gift card reconciliation track for Apple Inc o Stage 4: Defined Architecture o Stage 5: Defined roadmap for BigDataEdge™ implementation at the Apple Inc. o Stage 6: BigDataEdge™ Implementation in multiple location of Apple Inc. Project Onsite Location: USA Project Offshore Location: -Not Available- Project Start: November 2011 Project End: Not Known (From January 2014, Infosys BigDataEdge™ will perform implementation and delivery, in fast progress) Infosys is delivering BigDataEdge™ to Apple from Atlanta, GA, USA: Address - 3200 Windy Hill Road SE, Suite - 100W, Atlanta, GA 30339, USA; Phone:+1 770 799 1860; Fax:+1 770 799 1861 Infosys BigDataEdge™ serves Apple Inc using Reconciler platform Reconciler for Apple Inc, a generic financial reconciliation product for reconciling many transactions (inbound and outbound), which happen through many sales channels 39 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. (Retail and Online). This is a highly scalable and configurable platform that supports multi tenants of different products. Reconciliation platform provides one standard and scalable platform to accomplish reconciliation needs for reconciliation of various products - Multi tenancy support. iPhone Reconciliation, Gift Card Reconciliation and iTunes Reconciliation, are tenant applications on boarded specific to iPhone, Gift Card and iTunes product reconciliation that provide user interface to configure the templates definitions, to configure the reconciliation rules, analyze the results of reconciliation performed by the platform and enable end user actions for identified use cases. Infosys BigDataEdge™ is implementing multiple Hadoop for processing the billion transactions in Apple Inc. Infosys BigDataEdge™ is scheduling Oozie workflow engine to run multiple jobs and performs capacity planning, cluster set-up with Replicas, and Back up. Infosys BigDataEdge™ executes Performance Tuning (process tuning and query tuning), Stress testing on each of this application to handle the volume of a billion transactions per month. Infosys provides Enterprise wide Performance Engineering and Optimization solution to Apple Inc. The solution focuses on delivering high performance and scalability for mission and business critical applications. Hadoop Hive Grid Gain Oracle Coherence MongoDB Java 5 IBM WebSphere eXtreme Scale Design Patterns: Web Development: J2EE Distributed GoF and J2EE Web Components Technologies: EJB 2.1 Patterns (Servlets / JSP), Struts JMS, IBM MQ Series 1.1/1.2, Web Services, JavaScript, XML, XSLT, HTML, CSS Databases: Oracle 10g, SQL Server 2000, MySQL Sever Tools / Frame Work: Ant, Development Maven, XDoclet, Quartz, Methodologies: Sonar, Jenkins, Crucible, SCRUM, RUP Struts 2, SPEL, Spring MVC, Spring Batch, JQGrid, JQuery, Hibernate 3.0, Application and Server: Tomcat BEA Web Logic 8.1, JBoss WebSphere Web 4.1+, 7.0 / 3.2.7, Browser Technologies Agile HTML/HTML5, JavaScript, CSS/CSS3, AJAX, JSON, JQuery 40 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. Log4J, JUnit, TIBCO Portal Builder, Spring Cache Currently, Infosys performs TeraData to Hadoop Migration Semantics Aggregation, for Apple Inc Hive Oozie TeraData Hadoop Platform: HortonWorks 1.0.3 Project Start: February 2013 Project End: During 2014 Infosys delivers Apple, Inc from Infosys office located in CA, USA: 7707 Gateway Boulevard, Suite 110, Newark, CA 94560; Phone:+1 510 742 3000; Fax:+1 510 742 3090 Infosys India delivers Hadoop EDW for Apple Inc, since 2010 Hadoop-EDW Semantic Integration A Semantic application is developed for carrying out all the Aggregate processes. The Hive queries are required to be formed and configured at design time scripts for the Oozie workflow. The Autosys job calls the python script which executes the Oozie workflow and in-turn the hive queries automatically. The metadata comes from the semantic framework which connects to Oracle using myBatis. The metadata is updated automatically using framework. o Infosys developed oozie workflows for end-to-end execution and framework interaction. o Infosys performed Hive Queries to accomplish the task through joins and functions. o Infosys developed and tested Autosys jobs to schedule the execution of aggregate tasks after the dependencies are fulfilled. o Infosys performed Unit testing, Data Validation and Production Deployment. Hadoop-EDW Semantic Framework Development A Generic Semantic framework is developed for carrying out all the Semantic processes. The framework executed the queries automatically after being started by the Autosys job. The framework had the capability for statistics collection and used to update the run-time metadata. Infosys has developed Java classes for end-to-end framework interaction and developed the utility functions. 41 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. Java Hadoop Hive Oozie iBatis Teradata Eclipse Mac OS X Linux Hadoop EDW Pilot phase for Apple Inc Hadoop EDW Pilot phase is a cloud migration project developed using java that uses Hadoop as File system and Hive. Data of various applications is loaded into the EDW using the ETL process. PoC was created by Infosys to migrate the EDW data from Teradata to Hadoop. Data from Teradata tables is migrated to Hive tables and the Hive QL is used to run map-reduce jobs on HDFS to load the data from flat files to the core tables. Data generated by various applications across all regions is finally aggregated as an end result. In addition to faster processing and reduced cost storage, it also migrate existing data from Teradata to Hadoop. Infosys developed Java classes for end-to-end framework for interacting with Hadoop Infosys used Hive Queries to accomplish inserts and update on data through joins and functions. Hadoop Hive Java Teradata Eclipse Mac OS X Linux - Project Start: June 2011 Project End: April 12 Infosys BigDataEdge™ developed and delivered Replication- EDW2.0 for Apple Inc Replication EDW 2.0 is a framework to replicate data to and from databases within and across multiple products spanning across various geographical regions. It is built in 3 layers - Eventing layer for producing and consuming messages to and from Application Teams, Replication Engine for replicating data from one database to another and Orchestration Layer for wiring eventing layer with replication. It consumes messages from application teams as events, uses built in intelligence to direct this event to an appropriate replication flow, execute replication flow and notify application team of the end result. Also, it is capable of replicating data to and from products like Teradata, Oracle and Hadoop. Infosys created PoCs for Talend Studio and Talend Big Data - ETL tools that provide adapters for interacting with varied products. Infosys use Perl to write a script for accepting some data from user validate it and pass it to stored procedure running on a server. 42 Sales Intelligence™ Report Infosys BigDataEdge™ Java, J2EE Teradata Utilities “Everything about Customers”. Perl Mac OS X Linux Infosys BigDataEdge™ developed and delivered Hadoop EDW 2.0 for Apple Inc Hadoop EDW 2.0 is a cloud migration project developed using java that uses Hadoop as File system and Hive as the underlying Database. Hadoop EDW 2.0 is provides cloud-based storage to data generated by various Applications across all regions and generates aggregates as an end result. In addition to faster processing and reduced cost storage, it also migrate existing data from Teradata to Hadoop. Project Start: June 2011 Project End: November 2011 6 - Infosys BigDataEdge™ product is used in T-Mobile’s Big Data - Web Analytics Infosys BigDataEdge™ is engaged with the T-Mobile Enterprise Architecture and Transformation Solution team to build a next generation web analytics platform that can handle massive amount of data coming from multiple channels and properly correlate those data to build insights and next generation Business Intelligence Platform. The new platform required to be capable of handling data coming through click stream and correlate those data with subscriber and corporate data stored in enterprise DW. Infosys BigDataEdge™ performed Research to understand the current Landscape and Data Sources for T-Mobile Web BI Platform and also documented the current DWBI Architecture. Infosys BigDataEdge™ performed detailed analysis of the Data Sources and current technologies used to analyze those data, for T-Mobile Infosys BigDataEdge™ created integration architecture with a focus on using new technologies to augment the current analytics, for T-Mobile Infosys BigDataEdge™ Identified and documented use cases that should be implemented in the new Platform, for T-Mobile Infosys BigDataEdge™ performed installation of Hortonworks Data Platform, for TMobile. Infosys BigDataEdge™ designs Hive tables with partitioning, bucketing and indexing and also created Hive queries to support the business objectives. 43 Sales Intelligence™ Report Infosys BigDataEdge™ Infosys BigDataEdge™ is monitoring and managing Hadoop cluster using Ambari and Nagios Linux Hive 0.11 HDP 1.3 Ambari Zookeeper HBase “Everything about Customers”. SQL Server Eclipse jqGrid jQuery BigDataEdge™ - Customer Satisfaction Index: 7/7 Project Start: March 2013 Project End: Expected to be completed during February 2014 Project delivery location from Infosys office located in Bellevue, WA, USA 7 - Infosys BigDataEdge™ product is used in GAP Inc for ‘Mainframe Re-hosting Program’ Infosys BigDataEdge™ is engaged with the GAP Inc to discontinue use of Mainframe and moving the application to Clerity platform in RHEL. Infosys BigDataEdge™ and Dell prepares requirement gathering after reviewing Blue Printing documents for new platform, a. Target Infrastructure b. Database Architecture c. Security Architecture d. Printing and Reporting e. Interfaces f. Tape data Migration Strategy g. Backup, Archival and DR solution Infosys BigDataEdge™ is creating database Migration Strategy for re-hosting the mainframe DB2 Databases to target database DB2 on Linux platforms Infosys BigDataEdge™ has installed UDB software on Linux environments and manages DB2 on z/OS and DB2 on Linux Infosys BigDataEdge™ automates the migration tasks to eliminate manual errors in both Mainframe and Linux IBM COBOL JCL DB2 RHEL z/OS CICS CA Platinum RACF UniKix TPE/BPE DB2 10.1 LUW BigDataEdge™ - Customer Satisfaction Index: 7/7 44 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. Project Start: June 2012 Project End: March 2013 Project delivery location from Infosys office located in San Francisco, CA, USA 8 - Infosys BigDataEdge™ - Strategic Insights Practice of Campaign Analytics for Ally Financial, Ally Bank Infosys BigDataEdge™ is developing analytical solutions to enable segmentation and targeting, campaign design and measurement, for Ally Financial, unit of Ally Bank. Infosys BigDataEdge™ is designing solutions to implement new direct marketing strategies, for Ally Financial. Infosys streamlines and automates the segmentation process to achieve high levels of quality and improve operational efficiency, for Ally Financial. Infosys has streamlined the Cross-Sell marketing process which helped in cutting down the manual effort from1.5 FTE to 0.75 FTE, 80% reduction in space usage and 78% reduction in total cycle time. Technology Landscape used by Infosys BigDataEdge™ for Ally Financial: Analytical Tools: SAS (Base, Big Data: Hadoop RDBMS: Office Tools: Operating Macros, STAT & Access), and MADlib Oracle Microosft Systems: SAS Enterprise Miner, SAS EG, R, Office Suite Windows MATLAB, UNICA NT/XP Project Onsite Location: Plano, TX, USA Project Offshore Location: -Not AvailableProject Start: September 2011 Project End: Project implementation is in Progress. Delivery date is not known, due to upcoming deployment challenges. Infosys is delivering BigDataEdge™ to Ally Financial from Atlanta, GA, USA: Address 6100, Tennyson Parkway, Suite 200, Plano, TX 75024, USA ; Phone +1 469 229 9400; Fax +1 469 518 3858 45 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. 9 - Infosys serves Sears Holdings Corporation using IBM Infosphere DataStage 8.0.1 and preparing PoC using Infosys BigDataEdge™ Infosys is designing a common promotion framework for both Sears and Kmart for sending the promotions (coupons / gift cards / mail in rebates) to the store and the corresponding signs information to the store. This involves sending the best price for a product in a store among the promotions set by multiple merchants at corporate office. The price selected in a set of business rules are sent to the store and sent to a third party for printing the circulars/coupons. Also, Infosys Technologies is validating the business rules against the store and product like store authorization, best price winning for a product, sign information, sending the promotion prices of a product to the store register, handling different promotional deals for a product etc. The main Functionality of the project is to send all the deals and prices in Kmart and Sears Stores and the business team that creates the offers online, and then Infosys have to send those offers to mainframe at the offer product store level. Once the offers are created then online team run the java batches to create the market and product files and then we start processing the data. Infosys executed several steps to process the data. i. First, Infosys explore the markets and products and then perform store authorization and chaining and collision. Infosys process around 1.5 billion records in these steps and then load the staging tables. ii. From the staging tables, Infosys send the data to all the downstream systems. For the mainframe system which sends the price to stores, Infosys provide the fixed width files at event level, event product level and event product store level. There are other downstream systems like coupon, rewards, rebates etc. iii. For signing, Infosys sends the data to a system which is called RES and it also provide data to allocation system. iv. Then, Infosys load the datastore table which is used for Microstrategy reporting where Infosys keep the history records. v. The above process runs every day as part of Infosys regular run and beside that Infosys run emergency batch twice daily. Infosys used DataStage Designer to develop various Parallel jobs and Server Jobs to extract, cleanse, transform, integrate and load data into different tables. Infosys has extracted data from different source systems such as DB2 UDB, Flat files, VSAM files, Mainframe DB2 etc. Infosys executes performance tuning on Datastage Parallel jobs due to huge volume. 46 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. Infosys uses Teradata Fast Load, Teradata Fast Export and different utilities of Teradata. IBM Infosphere DataStage 8.0.1 IBM DB2 UDB IBM Mainframe Teradata DB2 Shell Scripting WinCVS Versioning Control M Tool Scheduling Tool 10 – Infosys BigDataEdge™ serves Deutsche Bank using ‘Data Analysis Application’ Infosys performs design and development of analyzing financial data using Hive Infosys designing Hive UDF’s for XML data archived in HDFS Infosys is creating a PoC using Composite Data Virtualization tool, Hive and Oozie. Infosys is developing Maven plugins for Hadoop based maven repository Hadoop(HDFS) Hive Oozie Composite Build Management Tools: Maven Spring Java Oracle Development Methodology: Scrum Hadoop CDH 3/4 Environment: Location: 13777 Ballantyne, Corporate Pl, Suite 250, Charlotte, NC 28277; Phone:+1 704 972 0320; Fax:+1 704 972 0311 Project Start: Oct 2012 Project End: June 2013 11 - Infosys performs ‘Investor Portfolio Analysis’ and ‘NAV Prediction of omnibus funds’ for DBS Bank Ltd Infosys performs analysis of investor portfolio based on transaction summary provided by custom views of underlying oracle tables for other asset classes of WMS like bonds, equities etc. NAV prediction using various techniques like social media analytics and mining the historical transaction data. Hadoop Pig Hive Sqoop Shell Scripting 47 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. 12 - Infosys serves its customers using Weblog Analytics Infosys performs Big Data analytics include Extraction, Transformation and loading Data into Big Data Systems like Hadoop, Hive and Hbase. o Data comes from various sources like Application logs, Application Messages, Application Databases in different format. o The objective of the project is to extract, transform and load data to Hadoop based systems and do analytics on it like CDR analytics Bill Payment Fraud Detection Customer insights etc o The second track included digital media analytics and web traffic/Click Stream analytics based on digital media sources like, Awareness Direct Response E-mail Natural Search Paid Search Organic/Non organic traffic Platform (Web/Mobile) analytics o It also consists of purchase/order analytics based on Cart/Purchase log analytics to provide customer insights. Hadoop Hive MapReduce Oozie Shell Script Java Hbase - 13 - Infosys BigDataEdge™ - Strategic Insights Practice of Web Analytics for a Legal Services Customer Infosys BigDataEdge™ is serving a customer in Legal Services in understanding the user activity on their newly launched Legal Research portal and user dynamics which drives the product adoption and conversion (Paid Subscribers) using the Predictive Modeling. Infosys BigDataEdge™ team has analyzed all the drivers who play a vital role in adoption of the product and then based on the adoption score highlighted the key drivers 48 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. Infosys BigDataEdge™ team has assessed business needs of key business stakeholder and developed solutions to cater their needs Infosys BigDataEdge™ team is serving this customer to identify key customer engagement channels that are relatively more effective. Technology Landscape used by Infosys BigDataEdge™ for a customer providing Legal Services: Analytical Tools: SAS ( Base, Big Data: RDBMS: Office Tools: Operating Macros, STAT & Access), Hadoop Oracle Microsoft Systems: SAS Enterprise Miner, SAS and Office Suite Windows EG, R, MATLAB, UNICA MADlib NT/XP Project Onsite Location: 6100, Tennyson Parkway, Suite 200, Plano, TX 75024; Phone:+1 469 229 9400; Fax:+1 469 518 3858 Project Offshore Location: -nilProject Start: July 2010 Project End: 1st phase of the Project got completed during August 2011. 2nd phase of the project implementation is in fast progress. 14 - Infosys BigData™ serves its customers from its Infosys Office in CA, USA in “Foreign Exchange Risk Management Banking Domain Infosys is implementing and delivering Hadoop based applications like, o Cloudera CDH4 o Hive o Composite Data Virtualization tool Cloudera Hadoop HBase Map Reduce HDFS Hive Pig HBase Cascading Oozie MongoDB J2EE Technologies Oracle Coherence MongoDB Lucene/SOLR Amazon AWS Groovy on Grails Struts Spring Hibernate jQuery UML Rational Rose Rule Engine Web Services e-Learning LCMS Solutions Eclipse RCP - Project Start: June 2013 Project End: During 2014 49 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. Project Delivery from Infosys Office Location: 7707 Gateway Boulevard, Suite 110, Newark, CA 94560; Phone:+1 510 742 3000; Fax:+1 510 742 3090 Infosys - Technology Infrastructure in CA, USA Languages: J2EE Application ORM: Java, Groovy, Technologies: Servers: JBOSS, Hibernate, SQL, PHP RMI, EJB, JSP, TOMCAT, Apache MyBatis, Web Services CodeIgniter (Axis 2) (PHP) Frameworks: Spring, Struts, Grails, SiteMesh, Ruby on Rails Enterprise Integration: Batch, Spring Integration Search / Rule Engines: Lucene, SOLR, e-Rules, Drools CMS: Drupal e-Learning / Distributed LCMS Tools: Computing: OLAT Hadoop (Cascading, Map Reduce, HDFS, HBase, OOzie, Hive) Databases: Oracle, SQL Server, MongoDB Reporting UML: Tools: Jasper Rose Reports, JasperServer Caching Solutions: Oracle Coherence, JBOSS Cache Rational GUI: jQuery 15 - Infosys BigDataEdge™ serves its customers in ‘Mortgage and Online Credit Applications’ Infosys BigDataEdge™ serves its customers by delivering ‘Mortgage and Online Credit applications using its Big Data technologies. Infosys BigDataEdge™ has delivered Big Data projects involving, o Mortgages, Derivatives, Credit Card, Cable and Automotive applications o Enhancements to Banking Portals for Mortgage and Refinance applications. Infosys BigDataEdge™ is mapping the Business Processes associated with mortgage and refinance lending products from origination through to loan servicing to ensure that processes are in compliance, legal constraints and meet with new Financial Reform legislation and business rules. 50 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. Infosys BigDataEdge™ deploys JAD sessions with sales teams to capture process flow diagrams, business and functional requirements and translate the requirements into functional IT functional specs documents. Project delivery from Infosys Office Location: 13777 Ballantyne, Corporate Pl, Suite 250, Charlotte, NC 28277; Phone:+1 704 972 0320; Fax:+1 704 972 0311 Infosys BigdataEdge™ – Global Services IT Outsourcing Intelligence Infosys BigDataEdge™ serves Fidelity Investments Company in project ‘Sentiment Analysis’ Infosys is creating a PoC on BigDataEdge for Fidelity Investments Company, with the aim to perform ‘Sentiment Analysis’ to judge whether the customers are satisfied or dissatisfied with the services provided by Fidelity. Hive Pig Hadoop Backbone.js Ruby Rails JavaScript on Core Java Operating Systems: Windows, Mongo DB Linux Web Technologies: Web/App Server: Apache Jquery HTML, CSS Tomcat Web Server, Nginx Project Onsite Location: 82 Devonshire Street Boston, MA 02109, USA Project Offshore Location: Bangalore, India Project Start: July 2012 Project End: Infosys BigDataEdge™ product implementation is in fast progress Fidelity Investments, USA: Fidelity Investments’ IT Team has developed multiple applications with Netezza. Fidelity Investments’ have created a Java wrapper to pass SQL statements to Netezza. Data Movement and ETL: o Fidelity Investments’ IT Team is developing scripts to load large quantities of data into Netezza along with scripts to extract and load aggregated results into ODS databases. 51 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. o Fidelity Investments’ IT Team is developing workflows with Informatica to migrate data between databases. Fidelity Investments’ IT Team has implemented its customized Fidelity’s ETL Framework which governed how to manage data from the feed file up to and including moving aggregated data to the final table. o Logical Data Modeling – Fidelity Investments’ IT Team is creating objects to feed data from staging tables into final tables via stored procedures, triggers and queues utilizing. About Fidelity Investment’s Database Engineering Practice: Fidelity Investment has bootstrapped an engineering discipline within the Fidelity Institutional business unit to bridge the technical gap between architectural requirements and operational implementation and support. Fidelity Investment implemented a 50TB data warehouse using Oracle Exadata and IBM Netezza hardware. About Fidelity’s Personal and Workplace Investing (PWI) Division – IT Strategy Fidelity’s Personal and Workplace Investing (PWI) Division is currently managing application and IT infrastructure – optimizing performance and capacity for the Fidelity’s Health and Welfare portals and Fidelity.com applications Fidelity’s Personal and Workplace Investing (PWI) Division’s infrastructure program is delivering Fidelity’s Health & Welfare annual enrollment program (1million + online participants) Fidelity’s Personal and Workplace Investing (PWI) Division initiating for Data Center consolidation, Virtual Data Center build outs, Infrastructure restacking (moving Solaris to Linux, Oracle R12 & 11G upgrades and IBM P770 platform migration and P2V migration) Fidelity’s Personal and Workplace Investing (PWI) Division is developing and delivering next generation contact center strategy and PoC, a cloud based SAS model Integrated with existing Fidelity’s telephony, desktop and CRM infrastructure supporting 20 million+ Fidelity’s customers. Java, Oracle PL/SQL, C, DATABASES: Oracle 9i, PLATFORMS: UNIX C++, Unix Scripting, SQL, 10g, 11g, 12g, Exadata; [Solaris, AIX], Linux, DB2 HTML, XML Sybase 12 & 15; Sybase IQ; z/OS, Microsoft Windows SQL*Server; DB2 v8 & v9 on z/OS; Netezza Greenplum PoC BYO appliance NoSQL Hadoop MYSQL Qlikview 52 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. About Fidelity Investment’s Data Architecture Practice: Fidelity Investment’s Data Architecture team is driving the data strategy for Fidelity’s Personal & Workplace Investing (PWI) division. This includes Data Governance, Metadata, Data Movement (ETL), Data Modeling, Big Data, Data Architecture Frameworks (Staging to EDW to Data Marts and OLAP), Data Warehousing, Business Intelligence, Master Data Management and Data Quality for all major PWI Technology Portfolio initiatives including SOA data strategy. Fidelity Investment’s Data Architecture team drives and develops the Data Strategy roadmap to align with their business requirements. Fidelity Investment’s three Data architecture team are: o General Data Architecture o Data Movement (ETL) and Data Modeling o Data Governance, Master Data Management and Metadata Fidelity Investment’s Data Movement in Enterprise Platforms - Data Architecture Practice: Hadoop: Reducing Business Latency in Fidelity Investment 53 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. Fidelity Investment uses Hadoop Based ETL Platform - Data Architecture Practice: 54 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. ETL Reference Architecture used in Fidelity Investment - Data Architecture Practice: Operational Data Refinery - Data Architecture Practice: Collect data and apply a known algorithm to it in trusted operational process Capture: Capture all data Process: Parse, cleanse, apply structure & transform Exchange: Push to existing data warehouse for use with existing analytic tools 55 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. Big Data Exploration & Visualization - Data Architecture Practice: Collect data and perform iterative investigation for value Exchange: Explore and visualize with analytics tools supporting Hadoop 56 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. Application Enrichment - Data Architecture Practice: Collect data, analyze and present salient results for online apps Exchange: Incorporate data directly into applications Data Ingestion: Fidelity Investments Ingestion into Hadoop can be categorized into the following types of data sources: RDBMS o Direct SQL Raw Files o RDBMS export files (delimited format) o Large unstructured data sets o Media files, other binary files Event-driven o Syslog o CDR o MOM (MQ, Tibco, JMS) o DB Trigger events 57 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. Tools for Data Ingestion – considered in Fidelity Investments Data Source Type Extract & Load Description RDBMS (SQL) Sources Sqoop, Talend Connectors Bidirectional SQL to Hadoop connectors for common data sources Event-driven Flume Streaming event data into HDFS Queues/Messages WebHDFS, Flume RESTful API to Write/Read data into HDFS. Flume has JMS Source to extract from Raw Files WebHDFS, Java Write batched files to HDFS DFS Client, RDBMS exports Infosys BigDataEdge™ serves Walmart Stores using largest ‘Online Transformation Program’ Project deals with developing historical database using Hadoop ecosystem. Walmart maintains last 10 years of data spread across 4500 pharmacy stores across USA and Canada. Aim of the project is to centralize the source of data for audit / legal report generation using historical database which otherwise are generated from multiple sources. Infosys BigDataEdge™ operation for the Bank: Infosys BigDataEdge™ team is designing Big Data applications using Hadoop and its family of products. Hadoop - Hive Flume Source Systems: Legacy HDFS systems (RDBMS and XML web logs) MapReduce Sqoop Platform: Ubuntu Target Systems: HDFS Project Onsite Location: USA Project Offshore Location: Hyderabad Project Start: February 2013 Project End: Duirng 2014 - Infosys BigDataEdge™ implementation is in fast progress 58 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. Infosys BigDataEdge™ - Cloud Computing COE Cloud Computing COE is Infosys Labs’s research wing of Infosys Ltd that uses the following, o Open source Bigdata o NoSQL databases and search technologies o Virtualization Platforms Infosys performs implemented of Bigdata Content Repository solution over Hadoop and Hbase and provides full text search using Apache Solr. Infosys is developing private cloud solution known as “Infosys Cloud Dynamic Resource Management (iCDReM)” during past six months. Infosys has developed network provision and management modules for iCDReM in VMware and Citrix XenServer virtualization environments, using VMware VI and XenServer Java APIs. Infosys implemented iCDReM service instance VMs backup-restore modules for VMware environment. Infosys has implemented geo-coordinate based search solution for its client using Apache Solr and MongoDB. Infosys BigDataEdge™ serves UL by performing Test Report Analysis Infosys BigDataEdge received the quality test reports conducted by UL (http://www.ul.com). Infosys BigDataEdge team is required to find the alternatives available for the same product having equal or more quality. Infosys BigDataEdge performs Content Extraction from the test reports using GATE and JAPE. Then, Infosys BigDataEdge, created ontology, processed the RDF and obtained the output. Hadoop / Fuseki RDF SPARQL MapReduce Apache Jena SDB GATE JAPE Java Project Onsite Location: USA Project Offshore Location: Trivandrum, India Project Start: July 2013 Project End: During 2014 - Infosys BigDataEdge™ product implementation is in fast progress 59 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. Infosys BigDataEdge™ serves AMP, USA by performing Customer Churn Analysis Infosys BigDataEdge is required to determine the reason for the Customer Churn happened to the company by analyzing previous Call Logs and Transactions. Infosys BigDataEdge is suggesting the optimal solution for enhancing the business by determining the best way to interact with the customers. These project activities are done by analyzing the call logs, transaction logs and other raw details available with the AMP, USA. Infosys BigDataEdge is focused on Building and executing the logic for the getting the optimal solution. Infosys BigDataEdge has created the Hadoop cluster for processing the data and executed performance tuning practices. Hadoop MapReduce Hive R Rhipe Java Project Onsite Location: USA Project Offshore Location: Trivandrum, India Project Start: February 2013 Project End: During 2014 Infosys BigDataEdge™ serves T-Mobile in project ‘T-Mobile Big Data Analytics’ T-mobile Big Data Analytics includes Extraction, Transformation and Loading data into Big Data Systems like Hadoop / Hbase. Data Source: Data comes from various sources like Application logs, Application Messages, Application Databases in different format. Project Objective: To extract, transform and load data to Hadoop based systems and perform analytics on it like CDR analytics, Bill Payment Fraud Detection, Customer insights etc. Hadoop Hive Pig Map Reduce Apache Drill Apache Whirr Apache Flume Oozie Cassandra MongoDB Hbase Amazon DynamoDB Ruby on Rails JAVA Spring JAVA C Project Onsite Location: USA Project Offshore Location: Pune, India Project Start: March 2013 60 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. Project End: During 2014 - Infosys BigDataEdge™ product implementation is in fast progress Infosys BigDataEdge™ serves Australian Bank using ‘Online Transformation Program’ The Australian Bank required performing Analytics. Infosys BigDataEdge™ has enabled the Australian Bank in reading and analyzing past 5 years of historical data and generates legal reports. Australian Bank is enabled in answering products relevant data, which assists them in analyzing customer's interests and improvements necessary to the existing system. Infosys BigDataEdge™ Operation for the Australian Bank: Infosys BigDataEdge™ team is designing Big Data applications using Hadoop and its family of products. Hadoop - MapReduce Hive Sqoop HDFS Flume Platform: Ubuntu Legacy systems Target Systems: HDFS (RDBMS and XML web logs) Project Onsite Location: Australia Project Offshore Location: Hyderabad, India Project Start: May 2013 Project End: During 2014 - Infosys BigDataEdge™ Implementation is in fast progress Infosys BigDataEdge™ - India Operations Infosys BigDataEdge™ - Cloud and Big Data Practices from Pune, India BigDataEdge™ PoC: Infosys creates multiple PoCs using the BigDataEdge™ product and demonstrating them to existing customers. Retail PoC for Infosys customer: Most of the BigDataEdge™ PoC are relating to customers based on various criteria and finding their consolidated financial exposure, fraud detection using a machine learning algorithm, sentiment analysis from Twitter feeds, indexing a document repository and searching through it, analyzing customer data from multiple sources and suggesting the next best action to the business. 61 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. Telecom PoC for Infosys customer: Infosys BigDataEdge™ created and submitted ‘Big Data Analytics PoC’ with highlights on production solution development. The details of this PoC are as follows, o Data Extraction from XML logs and loading into Hive, aggregate, load to HBase, REST implementation to access this data from HBase o Relating data from multiple sources and aggregating it to generate Tableau reports o Customer Clickstream traversal and related analytics using Web Logs o Funneling customers for targeting advertisements and promotions using Web Logs, Network Logs and Internal Warehouse Data. Currently, Infosys BigDataEdge™ is creating PoC for most of its telecom customers in USA. Infosys BigDataEdge™ product serves a Telecom customer Infosys BigDataEdge™ creates multiple PoCs for performing ETL from Web logs, traditional databases, application logs, CDR, etc. into Hadoop and NoSQL databases and performing analytics on it. Infosys BigDataEdge™ prepares Production Solution developed to extract data from web logs, network logs and traditional database system; relate this data and analyze it to provide insights to the customer regarding the market segmentation of their products and services, marketing spend efficiency and some more for targeting their advertisements and promotional campaigns better. Infosys BigDataEdge™ designs ingestion adapters to perform ETL from web logs and traditional databases into Hadoop file system. Infosys is writing Java code and Hive scripts for relating and analyzing data Infosys is creating reports using Tableau and HTML+JQuery and using Oozie for automation CoreJava Hadoop – MapReduce Hive Oozie HDFS Tableau HTML Javascript JQuery - Project Onsite Location: USA Project Offshore Location: Pune Project Start: February 2013 Project End: During 2014 - Infosys BigDataEdge™ Implementation is in fast progress 62 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. Business Operations with BigDataEdge™ product: a. Infosys BigDataEdge™ is building Roadmap using BigDataEdge™ product b. Infosys BigDataEdge™ is identify features like Adapters, Components, Algorithms c. Infosys BigDataEdge™ is extensively performing 'Competition Analysis’ on Big Data Practice. d. Infosys BigDataEdge™ executes all its Proof of Concept process and forms its team as per customized project deliverables for its customers, in Big Data Practice. e. Infosys BigDataEdge™ creates Use case conceptualization f. Infosys BigDataEdge™ provides inputs to conceptualize Pricing Models g. Infosys BigDataEdge™ prepares ‘General Acceptance Audit’ h. Infosys BigDataEdge™ is intensively identifying Business Value for each and every client leveraging Statistical algorithms used in Big Data Technologies to deliver descriptive, diagnostic, predictive and prescriptive modeling. i. During March 2013, Infosys BigDataEdge™ has developed product on “Customer Service Analytics (CSRi)”, where its PoC is created with key Ratios computed using Statistics. 63 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. About Sales Intellect Company Sales Intellect Company is a Sales Intelligence and Business Research Company that provides ‘Sales Intelligence™’ company reports derived from Big Data/Data Sciences. Sales Intellect’s Sales Intelligence™ portal is a retail E-Commerce web-based application that provides a well-structured repository of 5 million Sales Intelligence™ company research reports derived from Big Data / Data Sciences, and altogether has 1 Billion Sales Intelligence™ company reports, across all Industry verticals and domains from all geographical regions. Sales Intellect serves through Sales Solutions, Solution Sales, Sales Services and Mobile Sales Intelligence™, for several leading Companies. Sales Intellect prepare Sales Intelligence™ for Customer Sales Team to increase Sales using 16 Sales Intelligence™ like Business Model, Business Architecture, Business Strategy & Planning, Business Initiatives and Opportunity Evaluation, Technology Intelligence, Project Management Intelligence, Product Intelligence, Global Services Projects, Information Technology Intelligence, Information Technology Roadmap, Information Technology Strategy, HR Intelligence, Financial Analysis and Marketing Intelligence. Awards: Sales Intellect Company has won awards like Red Herring Top 100 Asia finalist 2013, Stevie® Awards for Sales & Customer Service 2013, Rockstar of the Stevie® Awards 2013 and NASSCOM IP4Biz Award 2012 in INTEROP Mumbai 2012 and many awards / honours like Business Judge in 2013 Stevie Award for 'Women in Business', USA; Business Judge in 2013 Stevie Award for 'International Business Award', USA; Stevie Rockstar Award - Business Judge for 'Sales and Customer Service Award’ 2013, CII – Confederation of Indian Industry 2004, etc. Quick facts Sales Intelligence™ + Marketing Intelligence + Business Research + Human Intelligence, using Big Data = Sales Intellect Company. ‘Everything about Customers’ Prepare: Sales Intellect’s Sales Intelligence™ company reports For all Industry Verticals | Domains | Countries | Regions Mission: To provide Complete, Accurate, Reliability, and Timely Sales Intelligence. Vision: To increase Sales/Revenue of Companies using Sales Intelligence™ 1 Product cost: between US$ 10,000 to US$ 8 64 Sales Intelligence™ Report Infosys BigDataEdge™ “Everything about Customers”. Trademarks: Sales Intelligence™ Headquarters: Chennai, India Established: 12th April 2010 Contact Details Name: Mr. Churchill Dass Prince, Founder & CEO E-mail: churchill@salesintellect.net Corporate Headquarters: 2/86, 1st floor, Cisons Complex, Montieth Road, Chennai 600008, Tamil Nadu, India Mobile: +91-766-767-7775 and +91-766-776-6777 Phone: +91-44-28540100 USA Office Address: 701, Jackson Road, Silver Spring, Maryland, 20904, MD, USA Web-site: www.salesintellect.co & www.salesintellect.net 65