Discussion Board
Go to the previous messageGo to the following message
Current Forum: Homework 5 General Forum
Date: Sun Nov 4 2001 9:49 am
Author: Lee, Peter <petel@cmu.edu>
Subject: Re: posting extra credit information

Oops. In the previous post, there were several formatting problems because Blackboard interpreted the html. Here is the entire message again:

======

Yes, we're working on the extra-credit info right now. If you or anyone else has ideas for improvements, you should feel free to post them. We will respond with an indication of extra-credit (or not).

Here are just a few of the kinds of things you might consider, focused on part 1:

- Suppose you have a web page at http://www.foo.com, and in this web page there is a passage that goes like this:

...documentation is available on <a href="http://www.bar.com">how to build an invisibility potion</a>...

Normally, the keywords such as "invisibility" and "potion" would be associated with www.foo.com. But in essence they should probably be associated with www.bar.com instead.

How would you have to change the structure of the search engine to do this?

- Special characters (eg, foreign characters, accented characters, etc) in a web document are indicated by special keywords that begin with the ampersand ("&") character and terminated with the semicolon (";"). So, for example, an incompressible space is indicated by " ". At the moment, the ampersand is treated as though it is whitespace, not a special keyword delimiter. As a result, keywords such as nbsp are indexed, when probably they ought to be ignored.

How would you change Part 1 so that these special formatting keywords are ignored?

- One special kind of tag in web documents is the "META" tag. For example, a web page for a very interesting course on data structures and algorithms might include meta tags like the following:

<meta name="keywords" content="trees, graphs, big-O, fun homeworks">
<meta content="The home for the 15-211 course at Carnegie Mellon University">

These meta tags don't get displayed by the web browser --- they are present only to "help" search engines do better indexing. The current Part 1, however, completely ignores meta tags.

How might you improve Part 1 so that meta tags are taken into account by the search engine?
Post response

Go to the previous messageGo to the following message
Current Thread Detail:
posting extra credit information      Liu, Limin Angela      Sat Nov 3 2001 8:22 pm       
Re: posting extra credit information      Lee, Peter      Sun Nov 4 2001 9:43 am       
Re: posting extra credit informa...      Lee, Peter      Sun Nov 4 2001 9:49 am       
Re: posting extra credit info...      Liu, Limin Angela      Tue Nov 6 2001 11:00 am       
Re: posting extra credit i...      Lee, Peter      Tue Nov 6 2001 1:04 pm       
Re: posting extra credi...      Shi, Ying      Tue Nov 6 2001 2:02 pm       
Re: posting extra cr...      Liu, Limin Angela      Tue Nov 6 2001 2:06 pm       
Re: posting extra...      Cipriani, Jason A.      Wed Nov 7 2001 8:34 pm       

Back to previous screen