The four parts of a URL This presentation is on URLs. URL sounds

advertisement
The four parts of a
URL
URLs – uniform resource locators
IT skills: compose URLs
IT concepts: parts of a URL, shortened URLs,
domain names, directories, file names
This work is licensed under a Creative Commons Attribution-NoncommercialShare Alike 3.0 License.
This presentation is on URLs. URL sounds
exotic – it stands for universal resource locator
– but it is just a geek term for a Web address. A
URL is the address of a resource on the Web.
You’ve typed URLs into Web browsers many
times, but may not have thought about the
meanings of each of the four parts of a URL.
We will review them and also mention
shortened URLs.
In order to understand the parts of a URL, we
will also cover domain names, directories and
file names.
Communication
technology
URLs are a technical topic, but understanding
them will also help you use the Web, so we
include this presentation at two places in our
class outline.
Where does this topic fit?
• Internet concepts
– Applications
– Implications
– Technology
• Internet skills
– Application development
– Content creation
– User skills
URL – a Web
address
URL, uniform resource locator
http://som.csudh.edu/fac/lpress/shortbio.htm
To retrieve a page from a Web site, you type its
URL into the box at the top of the browser and
hit enter. The browser then retrieves the page
from the server and displays it. URL is just a
fancy way to say the Web address of the page.
request
page
Client
Server
In this example, a short professional biography
that is stored at the listed URL is retrieved and
displayed.
The four parts of a
URl
Four parts of the URL
http://som.csudh.edu/fac/lpress/shortbio.htm
If you look at URLs, you see that they can be
broken down into four parts. Note that some of
the parts may be omitted at times – we’ll talk
about that later. For now, let’s take a look at
each of the four parts.
http://som.csudh.edu/fac/lpress/shortbio.htm
http://som.csudh.edu/fac/lpress/shortbio.htm
http://som.csudh.edu/fac/lpress/shortbio.htm
Part 1 – this is a
Web page
http://
http://som.csudh.edu/fac/lpress/shortbio.htm
Part 2 – the domain
name of the server
The domain name
http://som.csudh.edu/fac/lpress/shortbio.htm
The URL prefix “http://” indicates that this is
the locator for a Web document that should be
retrieved and displayed. It turns out that Web
browsers can also function as clients for other
services like file transfer, but that is seldom
done. For that reason, one can often omit the
“http://” since most browsers will plug it in for
you.
The next part of our example URL reads
“som.csudh.edu”. That is the domain name of
the server. Domain name is another fancy geek
word – it is the name of the computer the server
program is running on. We will say more about
domain names in another presentation. For
now, it’s enough to say that the domain name
identifies a unique computer on the Internet.
Part 3 – the
directory the page is
in on the server
The directory
http://som.csudh.edu/fac/lpress/shortbio.htm
root
fac
staff
* lpress
jsmith
admin
...
The next part of the URL is the directory or
folder containing the page we are looking for.
Every computer has many files in storage. We
would quickly become confused in trying to
locate a single file in a list of say 100,000
names. For that reason, we organize our storage
into hierarchical directories, sub-directories,
sub-sub-directories and so forth.
jdoe
Here we have three sub-directories – for staff,
faculty and administrators. Since I am a faculty
member, my directory, named “lpress,” is a
subdirectory of the fac directory. Each faculty
member – jsmith, jdoe, and the rest would have
directories in the fac directory.
Part 4 – the name of
the file on the server
The file name
http://som.csudh.edu/fac/lpress/shortbio.htm
root
staff
lpress
fac
jsmith
The final part of the URL is the name of the file
we wish to retrieve, in this case “shortbio.htm.”
Since shortbio.htm is in the lpress directory, the
Web server will send that file back to the client,
which will display it. The user can then read
my short biography.
admin
...
jdoe
shortbio.htm
Shortened URLs
URLs may be shortened
http://www.csudh.edu/studentaffairs/financialaid/faq.
shtml#apply_for_scholarships
http://bit.ly/djeREW
(I shortened the URL using the service at http://bit.ly. There
are others that do the same thing.)
Note that the file name has two parts separated
by a dot. The suffix or “extension” indicates
what type of file it is. The suffix “htm” means
that this is an HTML file, in other words a Web
page. (We will say more about HTML later).
Let’s look at a couple more things concerning
URLs.
For a start, you can save space by shortening a
URL. If you try them out, you will see that both
of the links shown here are to the same page.
There are many URL shortening services on the
Internet. I used one called bit.ly to create this
example.
Shortened URLs can
be risky.
Know the source of a shortened URL
Be cautious when clicking on shortened URLs.
Shortened URLs hide the actual domain name,
so you cannot tell where they link to by looking
at them.
http://bit.ly/9dLHk5
Click the link shown here to see why you
should only go to shortened URLs from trusted
sources.
Most Web clients
assume http:// by
default
You can delete the http://
As mentioned previously, if you delete the
http://, the client will assume the message is for
a Web server and request the page as if the
http:// had been included.
som.csudh.edu/fac/lpress/shortbio.htm
Default file names
You can delete the file name
http://bpastudio.csudh.edu/fac/lpress/
The Web server will look for a file with a default name, which
is specified by the server administrator.
Traditional default names are index.htm, Index.html,
default.htm, and default.html.
If you delete the file name, the Web server will
look for a file with a default name, which is
specified by the server administrator.
Traditional default names are index.htm,
Index.html, default.htm, and default.html.
Summary
URL summary
http://som.csudh.edu/fac/lpress/shortbio.htm
This request is for a Web (http) server
The server program is running on a computer with the domain
name som.csudh.edu
Look in a subdirectory called fac/lpress
If there is a file called shortbio.htm, send it back to the client to
be displayed; if not, send an error message back to the client
Domain name = the unique name of a computer on the Internet
Self-study questions
1. Without looking back, can you describe the four parts of a URL
and explain what each means?
2. What happens if you misspell the file name in a URL?
3. What happens if you misspell the domain name in a URL?
4. Can you omit the domain name?
5. What happens if a URL shortening service like bit.ly goes out
of business?
Here we summarize the meaning of the four
parts of a URL. In addition to learning about
the parts of a URL, we saw that they may be
shortened using a service like Bit.ly and that
some parts of the URL were optional.
We also learned about domain names,
directories and file names.
Download