Fun Fun Project One Building Your Very Own Web Server Fun Fun Project One 1 What is a Web Server? Program that understands the HTTP protocol and generates appropriate responses Clients “connect” to the machine Clients send a “request” Server reads request, generates “response” Client interprets response appropriately Fun Fun Project One 2 A Simplified Web Server Client asks for file Server finds appropriate file Server sends back a response header followed by the requested file’s data Server closes connection Fun Fun Project One 3 What Does “Connect” Mean? For all practical purposes, it looks like there’s data available via a file descriptor Stream of bytes Can be treated like any other file descriptor Not a FILE * (like stdio, stderr) Must use read() and write() system calls Fun Fun Project One 4 How Do You Identify Machines Domain names/IP address and ports http://www.cs.princeton.edu implies a machine named www.cs.princeton.edu and a default port of 80 http://127.0.0.1:8080/index.html Refers to local box (127.0.0.1 is me) Port # is 8080 (used for this project) File is named index.html Fun Fun Project One 5 How Do You Identify Files? File name is specified in Request Message Server maps that name to a real file Mapping can be done in whichever way server wants For example, /~vivek/index.html is actually /n/fs/fac/vivek/public_html/index.html In your web server, you can choose your own Fun Fun Project One 6 What’s In A Request Message? GET /index.html HTTP/1.0\r\n Connection: Keep-Alive\r\n User-Agent: Mozilla/4.72 [en] (X11..)\r\n Host: 127.0.0.1:31415\r\n Accept: image/gif, image/jpeg, */*\r\n Accept-Encoding: gzip\r\n Accept-Language: en\r\n Accept-Charset: iso-8859-1,*,utf-8\r\n \r\n Fun Fun Project One 7 What Do You Care About? GET /index.html HTTP/1.0 In particular, just index.html Assume “/” means “/index.html” Fun Fun Project One 8 What Could They Want? An honest-to-goodness file (me.jpg) An indirect request for such a file (such as “/” meaning index.html) An implied directory with index (/home/vivek instead of /home/vivek/) Just a directory listing A query (we don’t care about these) An invalid/nonexistent file Fun Fun Project One 9 What’s In A Response Message? HTTP/1.0 200 OK\r\n Date: blah-blah-blah\r\n Server: blah-blah-blah\r\n Content-Type: important\r\n Content-Length: 12345\r\n Last-Modified: blah-blah-blah\r\n \r\n Raw data Fun Fun Project One 10 What’s a Minimal Response? HTTP/1.0 200 OK\r\n Content-Type: stuff\r\n \r\n Data HTTP/1.0 302 Moved\r\n Location: newurl\r\n \r\n HTTP/1.0 404 Not Found\r\n \r\n But also Connection: close\r\n Content-Length: yyy\r\n Fun Fun Project One 11 Response when… File exists? Send it Directory without “/” suffix? Redirect Directory with index.html? Send it Directory with no index.html? List it For each list entry, add “/” if needed Failure(Not Found)? Bad Request? Fun Fun Project One Send 404 Send 400 12 How to Test Your Server? Use a browser(Netscape/IE) Use “wget” Support HTTP protocol http://www.gnu.org/manual/wget create directory hierarchy for retrieving Include some big images Fun Fun Project One 13 More Test Cases What if Request Message is not send/received in one packet… The server must read all the Request Messages before it gives any response message Remember the double carriage return and line feed? Your web server must consider this! I’ll distribute more test programs later on, check http://www.cs.princeton.edu/~yongwang/cos318 Fun Fun Project One 14 What is Content-Type? text/html image/gif image/jpeg (Other types not needed for project 1.) Fun Fun Project One 15 Need more info? HTTP 1.1 Specification – RFC2068 HTTP 1.0 – RFC 1945 man pages man man man –k blah read( ), write( ), open( ), close( ) Fun Fun Project One 16 Why open instead of fopen? Compare fopen, fread, etc., with open, read, etc We’re dealing with functions closer to the OS – easier to use in some cases Fun Fun Project One 17 What’s a File Descriptor? Sort of like a FILE * It’s an integer provided by OS Used to represent a stream of bytes Can represent file or network connection Behavior is slightly different Especially when reading/writing network Fun Fun Project One 18 General Steps Setup, and then Get next connection If file, read from disk If directory, generate listing Send all to client Close connection, wait for next one(nonpersistent connection) Fun Fun Project One 19 What Am I Given? Setup function Accept function Gets a connection from a client File type function Makes server available for connections Tells you what kind of file, if it exists Tells you how many bytes if a regular file Directory listing functions Gives you the file names one at a time Fun Fun Project One 20 Help! I’m Lost! Don’t know HTML? Use Netscape composer to see what to do View page source for various pages Do “telnet www.domain.com 80” and issue the GET manually (need to add “Host: www.domain.com” header) Ask Fun Fun Project One 21 Why Are We Doing This? Infrastructure for future projects Some OS/Networking interaction It’s fun, and not too bad Fun Fun Project One 22 Wrap Up Thanks! Q&A session next week! Office Hour: Wed 4:30-5:30 Fun Fun Project One 23