Innovations from Microsoft Research Asia ---------------------------------------------------------------------------------------------------------------------------------We should not be afraid of failure. If everything we did was successful, we wouldn’t be taking as many risks as we should and without taking risks, we would not innovate. ---------------------------------------------------------------------------------------------------------------------------------1.Digital Ink Owner:Jian Wang Contributor:Yu Zou, Zhile Wei, Dongmei Zhang, Shi Han and Ming Chang Digital Ink gives a tablet connected to a PC the ability to mimic the characteristics of ink on paper in a digital setting. It has all the advantages of using a real pen and paper with none of the drawbacks; users can write normal text, create a bullet point list, draw a diagram or build a table and the software will be able to discern the difference between each of them. It then becomes much easier to search through digital notes than those drafted on real paper, allowing the user to retrieve information much more quickly. 2. Speech Recognition and Synthesis (Speech Group) Owners: Frank Soong, Eric Chang Contributors: Yao Qian, Yi-Ning Chen, Lijuan Wang, Jianlai Zhou, Chao Huang, Terry Wang, Zheng Chen and Yu Shi. Our speech recognition engine, used for simplified and traditional Chinese, telephone or desktop speech recognition applications in MS products like Windows clients for accessibility, Chinese dictation in Office, Speech Servers, etc. is a giant leap in speech recognition. The basic acoustic units are custom designed to capture the unique, initial and final structure of tonal syllables in Mandarin. A highly accurate pitch tracking algorithm was developed to deal with the lexical tone nature of Mandarin speech. Both acoustic models and language models are trained to capture the intrinsic properties of Mandarin Chinese both in acoustic and language domains. 1 Our HMM-based Text-to-Speech (TTS) is developed for synthesizing highly intelligible and natural speech in different languages. HMM models of spectrum, pitch and voiced/unvoiced decision duration are first trained from collected speech databases. Then, based upon any given text, both segmental and prosodic parametric trajectories of speech parameters are synthesized in the maximum likelihood sense by the trained HMMs. Finally, high quality speech samples are generated by passing mixed excitations through time-varying, LPC filters. The HMM-based TTS has been tested successfully in more than two dozen languages, including non-tonal languages like English or French or tonal languages like Mandarin. It will be part of MS Exchange servers. 3. Chinese and Japanese IME (Input Method Editors) Chang-Ning Huang- Principle Consultant - (Natural Language Computing) Ming Zhou- Lead Researcher (Natural Language Computing) Owners: Ming Zhou, Chang Ning Huang Contributors: Jianfeng Gao, Kai-Fu Lee, Zheng Chen, Yijin Wang, Hong-Jiang Yue Zhang Zhang, Mu Li, Hua Wu, For years, language barriers prevented Asian users from fully enjoying the computer and the Internet. The Natural Language Computing Group at Microsoft Research Asia achieved significant breakthroughs and greatly improved users’ computer experience with the Chinese and Japanese input editor systems (IME). 4. Compound TCP (Transmission Control Protocol) Kun Tan- Researcher (Wireless and Networking ) Owner: Kun Tan Contributors: Jingmin Song Compound TCP (CTCP) is a congestion control algorithm for TCP to boost performance in such environments. CTCP increases the amount of data sent at a time by monitoring various parameters during transfer. Unlike many other high-speed TCP variances, CTCP ensures that its behavior does not have negative impacts on other applications. CTCP is now shipped in Windows Vista and Windows Server 2008. It is also available for earlier Windows OSes (XP and Server 2003) through hotfixes. 2 5. Halo Graphics Kun Zhou- Lead Researcher (Internet Graphics ) Owner: Kun Zhou Contributors: Xin Huang, Zhipeng Hu, Yaohua Hu, Xi Wang, Xinguo Liu, Minmin Gong Our highly rated graphics research has given gamers the chance to really enjoy the world of Halo III, having developed some core technologies for the highly acclaimed Xbox game that makes the virtual world appear more real. You can now better appreciate a constantly changing, realistic looking battle ground with UVAtlas, which gives the game the ability to texture map a 3D scene. Shield your eyes from a solar flare, created from our technology for realistic global illumination (Lightmap Compression). Watch the unbelievably realistic ripples and splashes cascade around you as you dive head first into the water, developed using the amazing system for modeling and rendering realistic water (River). Instantly feel the tickle of the grass on your face while hiding from an enemy with our technique for fast rendering realistic surface materials. 6. Entity Extraction (Web Data Management ) Ji-Rong Wen - Lead Researcher (Web Data Management) “In the next decade, I hope people can get all information they need through search technology.” Owner: Ji-Rong Wen Contributors: Zaiqing Nie, Ruihua Song, Yanfeng Sun Structured information about real-world products gets embedded in web pages and online databases. We developed a key technology to automatically classify, extract and integrate this information from the Web. With this, we can build powerful entity-level search engines, which enable people to get more accurate and clearer information for products in one visit, instead of browsing through a long list of pages. This technology was transferred to build Live Product Search and with this we believe we can make Live Product Search the largest product catalog in the world. 7. Relevance Verification Zheng Chen - Lead Researcher (Machine Learning) “If you are driven to innovate, want to work in an environment that allows you to leverage resources to influence tomorrow’s reality, and have fun at the same time; Microsoft Research Asia is the place for you to start on your path to success.” Owner: Zheng Chen Contributors: Tarek Najm, Ying Li, Li Li, Mingyu Wang, Benyu Zhang Editors of web sites bid for keywords that help direct users to their sites when search engines are used to search for products etc. Relevance verification looks at the keywords that editors want to bid for and checks them against the content of their 3 site, returning a yes/no result, letting the editor know whether the keyword they wish to bid for is relevant to their site and thus whether it is worth bidding for. 8. Cartoon.SDK (Internet Graphics ) Ying-Qing Xu - Lead Researcher “Insight is necessary at Microsoft Research Asia. Be it on an individual level or a team one, selfexamination and self-awareness are practiced regularly here to help improve the company as a whole.” Owner: Ying-Qing Xu Contributors: Lin Liang, Fang Wen, Jonathan Tien, Xin Zou, Yusheng Li, Qiufeng Yin, Harry Shum Cartoon.SDK is a fun technology that allows people to automatically generate vivid personalized facial cartoons from a photo of themselves. Giving users the ability to, say, create avatars for social networking web sites, this cool development has been successfully transferred into several Microsoft products, such as the Japanese version of MS Office, of which there were more than 17.4 million users in 2004. Microsoft has globally licensed this cartoon technology to several companies, and has been awarded the Microsoft Research Asia Best IP License Prize. 9. Interactive Computer Vision Jian Sun- Lead Researcher (Visual Computing ) Owner: Jian Sun Contributors: Yi Li, Lu Yuan, Chi-Kang Tang, Harry Shum Many computer vision applications still require a high-level of visual knowledge and an expertise that can currently be provided only by human input. With two products developed in this lab, we hope to change that. Using “Lazy Snapping”, even the average user can easily cut out an object of interest (e.g., portrait) in a photo and create a new scene with a different background; “Image Completion” is a tool for filling missing pixels or removing unwanted objects in a photo. With this, the user is able to effortlessly remove annoying objects or people in the photo in a visually plausible way, by simply drawing a few curves in the photo. What could be simpler? 10. Smart Thumbnail Xian-Sheng Hua- Lead Researcher (Internet Media) Owner: Xian-Sheng Hua Contributors: Fei Wang, Yong Wan, Hao Wei, Zhi-Jun Shi, Shipeng Li When your digital library gets a little too cluttered for its own good, Smart Video Thumbnail can help appease your anxiety and find what you are looking for without you ever breaking a sweat. The impressive video presentation technology helps users efficiently browse the video data in their digital library. Motion thumbnail provides a short synopsis of the original video, made up of a set of short high-quality and highly representative segments enabling users to glean more information about the video in a shorter period of 4 time. Static thumbnail, on the other hand, only takes one visually representative frame from a video, thus providing viewers with a rough idea of the content of the corresponding video in one image. Smart Video Thumbnail has been applied to many products, such as Live Search Video, Windows XP Media Center Edition, Windows Vista and MSN Video. 11. AutoMovie (Internet Media) Owner: Xian-Sheng Hua Contributors: Lie Lu, Yijin Wang, Yan-Feng Sun, Hong-Jiang Zhang AutoMovie is a five-star killer feature of Windows Movie Maker; it’s a one-click solution to home video editing! With home movie making becoming ever more popular, AutoMovie can help people take their own video footage and turn them into high quality films, with a soundtrack, without any hassle. AutoMovie automatically extracts highlights from your raw home videos and aligns them with music from your digital library, based on the content of the video and music. We developed an approach for examining the structure of your home video and chosen soundtrack, allowing the most relevant highlights and a matching rhythm of the music to be selected and aligned. In order to create more professional-looking results, the movie highlights can be satisfied by a set of editing rules and then matched to the content of the music, so that your footage is perfectly aligned with your tunes! 5