Willy Vasquez Rising Senior at MIT › Studying Computer Science and Engineering › Research with Shafi Goldwasser › Intern at Symantec Mobility Management Group Work of Christopher Domas of the Battelle Memorial Institute Brief overview of his talk at REcon › The Future of RE: Dynamic Binary Visualization The goal is to answer “what is this and what does it do?” Lots of time to identify patterns Finding the patterns is an art. Taking a computationally difficult task and translating it to a problem our brains naturally do Traversing thousands of lines of hex and making sense of it in 20 seconds Steganography Obfuscation Embedded Devices Unknown formats Our current best RE tools are completely dependent on known structure Gates’ Law › Software is getting slower more rapidly than hardware becomes faster › Amount of Information we need to analyze is growing exponentially Greg Conti › US Military Academy › Blackhat Aldo Cortesi › Nullcube › corte.si Even in unstructured data there are relationships, especially among local hex bytes Digraphs Ascii Image Audio Mapping data to Hilbert curves Goal: Understanding data independent of format Named after Georg Cantor Works off of emphasizing the idea of relationships between binary information Bayesion Method to classify certain types of formats Current binary parsing › Recursive descent: IDA style that follows patterns and calls in code › Linear sweep: objdump and goes through in linear fashion Rely on a structures grammar ..cantor.dust.. Uses probabilistic parsing, which does not rely on grammar A new way to look at binary information Can find demo from blackhat presentation: https://media.blackhat.com/bh-us12/Arsenal/Domas/_cantor.dust_.7z.zip No updates since last summer The full talk and slides located on the recon.cx website: › http://recon.cx/2013/schedule/events/20.ht ml