An Agent Capable of Learning to Create and Maintain Websites Anthony Tomasic, Ravi Mosur Alex Rudnicky, Raj Reddy, John Zimmerman Carnegie Mellon University 18 April 2003 Outline • Project vision • Problem – Assumptions – Inputs – Outputs • • • • • Missing capabilities Our approach Impact Evaluation Conclusion Project Vision – Honeydew • Honey – An agent – Learns by observation – Obtains advice and consent – Creates and maintains public project websites Project Vision – Honeydew • Honey Can Perform Tasks – Organize, manage and update a complex project website – Delegate tasks – Generate periodic briefing folders • Email extracted material and online documents with planning and summarization capabilities – Respond to specific information queries – Extract relevant information • WWW & mailing lists – Perform teaching – Communicate with other Honeys and EPCAs Problem Assumptions • Scope of a “project” website is predefined and not learnt: – – – – – – – – – Publications Presentations Milestone status News updates Demonstrations Links to collaborators Software releases Documentation FAQs Problem Assumptions • Things that Honey will not do – System administration • Capacity planning – Graphic design • Font selection • Site design • Some layout design possible … – Content creation Inputs to Honeydew • E-mail messages with updates to website – Volunteered and solicited information • Minutes from project meetings – Tracking project participants and events • Queries from external sources – Inferred information needs – Click sequences • Publicly visible events, not explicitly provided to webmaster – Conference appearances, news stories, etc. Inputs to Honeydew • Sequences of UI actions performed – Receive e-mail request to add paper to WWW site – Extract title, author, abstract, publication forum, funding agency – Think up file name – Copy attachment to conference paper directory – Update WWW page with info and link – Notify user of change – React to advice from user about change Expected Outputs • Project websites (5 subprojects, 1 project) • Report generation – Overviews of activity over time (e.g., quarterly reports) • Briefing generation – Overviews of current project activities • Question-answering agents – Google-like search, summarization in response to specific queries • Semi-automatic FAQ generation Expected Outputs • • • • • • Shared knowledge base of learned tasks Toolkit for rapid construction of new assistants Assistant Monitoring and Interaction GUI Requirements for “Assistant Aware Applications” Stream of papers Stream of masters and Ph.D. students Missing Capabilities – What • Identify significant webmastering events • Represent webmaster activities through generalizable descriptions • Create consistent and complete task representations • Formulate key clarification dialogs • Adapt to errors in task execution Our Approach • Ethnographic study of Webmasters • WoZ system for domain definition • Human webmasters with Honey observing activities • Information-sharing among EPCAs Our Approach • • • • In-line data labeling by humans Interactive clarification of human actions Lightly-supervised learning Generalizable and sharable representations of activities • Learn by being told • Enthnographic study of Honey users Impact • Relieve the user of routine maintenance tasks associated with web pages • Illustrate portability by using in other WWW task domains – HCII web page – Pittsburgh Post Gazette web page – Workflow systems Impact • Dramatic reduction of human effort in construction and maintenance of web sites – Improve time productivity by 100 to 500% – Little or no loss in quality of site • EPCAs that can learn the skill of cooperatively structuring and managing information • Dramatic reduction in the cost of construction of assistants – Reduce size of backroom knowledge engineers • Assistants become trainers of new Web masters Evaluation: mid-term and finals • Honey performance to be compared with 5 human subjects – 5 other human coaches to be used in providing data and inputs needed for the Honey to learn from Experience – Operational Version 0 in 3 months • Evaluation – 6 months – Honey performs 50% of tasks correctly • Evaluation – 12 months – – – – Elapsed time from e-mail to WWW update improved by 1.3 Elapsed time to assemble report improved by 1.3 Honey performs 75% of tasks correctly Quality of WWW site and reports comparable to human Infrastructure Input • Users • Architecture • Functional Specification • Text – Filtering – Summarization – Extraction • Quality Assurance • Quality Assurance • Evaluation • Knowledge Representation • Dialog Conclusion • Key emphasis – Learning – Coaching – Retargetting • • • • Potential huge impact to Webmaster job Large amount of shared infrastructure Many similar problem domains Many, many research problems Research Issue • Architecture Execution Recommend Learning User GUI Coaching GUI KB GUI Monitor/Do GUI Matching Summarization KB Event Stream Monitor/Do E-Mail Server E-Mail GUI Editor File System Command