|
The link above
is to my work web page at Carnegie Mellon.
You can actually view the structure that exists within any web
page. Use the “VIEW” pull-down menu on
your browser and then choose “Source.”
The first code recognized by a computer would be the “<HTML>”
tag that you’ll see about five lines down (after a bunch of remarks that the
computer ignores). This would be
equivalent to a document type – if you’ve ever done an advanced Google or
Yahoo search, you’ll notice that you can restrict your search to a type of
document. Right after the
“<HTML>” tag you’ll see the beginning of the “<HEAD>” of the
document. The chief portion of the
head of an html document is the “<TITLE>.” Note that I put a bunch of different
variants of my name in the title of my page.
I did that so that if someone looks for me as “Matt Marsteller” or
“Matthew Marsteller” or “Matthew R. Marsteller,” then when this page is
indexed by most search engines, there’s a good chance that most search
engines give greater weight to the title of a web page. The title of a web page is what shows up in
the blue bar at the top of your browser window. The web page also has “Web Page of Matthew
R. Marsteller” in big bold letters. If
you look in the source code (about a third of the way through the code),
you’ll see that this phrase is enclosed in “<H1>” and </H1>”
tags. This is a heading within a
document and can also be given more prominence by a search engine. The key operative phrase is “can be given
more prominence” – nothing is guaranteed.
When search engines rank their search results, a lot of things come
into play. Ranking algorithms (rules
the computer will follow) are often redesigned. We’ll explore searching of the Internet at
the end of our study together. For
now, I hope that you’ll appreciate that the typical web page on the Internet
can, and usually does, have structure that search engines can make use
of. Perhaps it’s a bit more crude than
we’re used to … but it’s there! There
are other concerns such as … is it standard HTML code? The answer is, unfortunately, no. One thing that is popular for people to do
is to use a word processor like Microsoft Word to produce their HTML code
(using the “SAVE As” function). It’s
easy to do, but the result is NOT standard HTML. Most browsers will interpret it correctly
(kind of) – hopefully most search engines will as well, eh?
|