Report #3
What are web search engines?

Table of Contents
Report #3
What is a search engine?
My first experience using
a search engine
What prior generations had to say
Yahoo...
Webcrawler...
Helpful hints
Using Yahoo and Webcrawler
What I've learned from search engines



What is a search engine...


A search engine is a tool used for searching for a particular subject or topic, or any word for that matter. It can be compared to a library system (UH carl, card catalog, etc.), but I think it is a whole lot easier and has a whole lot more information readily available in a quick amount of time. Several databases have been created which make up these search engines. Some names of search engines are Yahoo, Lycos, Infoseek, Webcrawler, Dejanews, Inktomi, and Opentext. These databases are used to search web pages and/or usegroups.

Generally, the search engine companies like Yahoo, Lycos, etc. set up databases that are located on their server as indexes. The databases are created or upkept through devices called demons or robots. The demons or robots automatically roam the DNS entries of the web pages and/or usegroups and collect the data that can be searched. The CGI is a programming code that interacts with the server and is responsible for doing the searching.

Each search engine is different in how and what kind of information they look for. There are search engines that are considered general purpose and some that are special purpose collections. Some search engines index the entire page, some only index titles and URL addresses. Others search through the use of approved categories. There are also business directories that are organized by approved fields that can be searched.

Therefore, to access information on the Internet about a particular topic is made very simple by just clicking on a particular search engine and typing in the topic you are interested in. Depending on which search engine you use, an index of pages or categories that may be related to your topic are shown in the results.



My first experiences with search engines...


For some reason the computer that I was using just wouldn't let me get into net search and I began to get frustrated. The computer would pop up with an error message and shut down netscape all together and restart the computer. I had to wait everytime for the computer to restart and load Netscape again. I finally quit and went to the CLIC lab. I decided that I would start with a search on cyber-community since it was part of our reports. I just typed the word in and pressed SEARCH, I didn't even bother to take a look at which search engine I was using. I was just happy to find a list of pages that were related to cybercommunities. I quickly clicked on the first link that appeared.

The only thing that I was unhappy with was the fact that it was taking so long to get into the net search. The search itself for the topic was really quick. There were only about 5 related pages that were found. So, I clicked on each page and browsed through and was generally pleased at how easy the whole process was. Then I decided that I would try a different search engine and also some different topics. I went back to net search and tried Yahoo. I searched about a topic I'm doing research on in another class. The topic was inclusion in public schools. The screen said "Please use a non-empty search string". I was lost. What is a search string? I only typed in one word, what was wrong? So, I just tried again. I typed in inclusion one more time and hit Search. Well, this time I came up with 33 matches. Weird, I thought. I just kept on going. The results were listed by the categories that they came from in the index and I thought that was unnecessary. I was generally pleased with the results and found the majority of them to be relevant to what I was studying.

On the most part, I thought that my practice with search engines went very well. I found information that I was looking for and didn't run into major complications. The only thing is that it sometimes may take a while for you to find the actual resource that you want. It takes a few tries sometimes to figure out what words to type in to do searches. I found a few pages that shows how to get the most out of your searches and which search engine to use. There is a great table that I found rating some of the most popular search engines and giving you the pros and cons and what it searches. I think this might be a good place to start on finding out information on how to do worthwhile searches.


What Prior Generations Had to Say.....

For the most part, I found that prior generations found that using search engines was a positive experience. They pretty much shared the same views as I did. They found it to be really easy, except for the large results. It required a little more effort to find a way to reduce the large amount of results, but the majority expressed positive feelings about search engines. I pulled a few quotes from some students:

Ragna Borries "I learned that it is necessary to carefully select the links that you choose to work with."
Curtis Nakao"There were no real problems or negative emotions to speak of."
Cherish Higashi" I found that by using this summary option, I was able to save a lot of time choosing to save the appropriate listings".



For the next part of this report I am going to select two different search engines (that are for different purposes and use different approaches) and try to explain how each one works. The two that I have chosen are Yahoo and Webcrawler.

YAHOO...


Yahoo! is a search engine made up of a hierarchical index. The web pages are set up into organized categories. Yahoo! supports simple Boolean searching. What makes Yahoo! different from Webcrawler is its hierarchical menu. Yahoo! searches categories, titles, and comments to find listings that have the keyword that was typed in for the search. Once it has found 100 matches, it will stop, which is the default setting. It is possible to change these defaults by selecting the key that says options next to the search box.

The Yahoo! search gets three different types of information. It will find categories of Yahoo!, end-sites, and the categories that match the end-sites which are listed. This is useful because you can either go to the end-sites or browse in the category in which it came from for other relevant sites that may not have been brought up by the search.

Yahoo! obtains its links in two fashions: the first way is through user submission, in which a creator of a web page actually submits and registers his/her page with Yahoo!. The second, alternative way that Yahoo! gathers links is through the automated robots that look out for new links at pages that are already registered.

Yahoo! has a new search engine. When you are at their home page, to the right you will see a link to a "new search". This is supposed to be an updated, more efficient way of searching with Yahoo!.I tried this search and found it to be a little more user friendly. For detailed information, see their help page for the new search features. Some added features include; relevance scoring on results and new query options for more efficient searching.

I find Yahoo! to be the best search engine to use while looking up general categories and topics. It is easy to look through the specified categories and narrow down to specific areas of interest.


WEBCRAWLER...


Webcrawler is one of the major search engines that use the demon searching approach. It begins with a few select number of documents or pages and through these pages, is able to identify new places by tracking and exploring the outgoing links from the original documents. This is where it's name comes from. It starts out small and grows and grows, layer after layer, feeding from the previous layer.

Webcrawler has a limited search language, and allows only some aspects of Boolean searching. Webcrawler is a general purpose search engine and it indexes only titles and URL addresses, as opposed to whole pages, as does by Alta Vista and Lycos.

Once the results are ready, Webcrawler displays its results in a list with a score that represents the relevancy to the topic searched. One good thing about Webcrawler is that it offers you a chance to display your results by titles only or with a summary. The summary is a good way to weed out unrelated links. Sometimes there are links that show up that are totally off the subject and it saves time to just skip right over it. There is also a place to specify the number of results you want to be listed. Another useful tool from Webcrawler is the feature is called the " link-back search method". It allows you to see who's linking to your Web site or to anyone else's. For example, while you are a page that you happen to like, you can click on the button at the top of the page that will access the "Surf the Web backwards link-back feature". Here, you can type in an address, and it will then show you the pages that are linked to it.

I think it is important to learn about all the search engines and know how to use it to your advantage. I came across some pages that were helpful to me in learning more about search engines:
The Search is Over
This is a good guide to search engines. They provide a brief overview of the major search engines and a great table that provides the pros and cons of these engines.

Webreference: Search Engines
This site explains what search engines are, how they work, and offer suggestions on how to get the most out of them.

Searching the Internet
This site offers a number of useful links to pages that are related to search engines. This is where I found most of my links for this report. It also links up to many search engines.

Diane Nahl's Favorite Search Sites
Here's a great page that has all the links to all the search engines and some links to some useful pages too.



Using Yahoo! and Webcrawler...

I chose the word "cyber-psychology" for my search on these two pages since I needed to gather information for my next report. I first started with Webcrawler. It didn't take long for the results to be shown. There were four sites that resulted from my search. That's it, I thought. I did get the Home page of Dr. James, where I got the word, cyber-psychology from so I was pleased with that.

Next, I tried the search in Yahoo!. Again the search was quick and now I was given eight matches. I looked over these links and found that none of them were the same pages that I found with my search using Webcrawler. Interesting, I thought. With closer investigation, I found that of the eight sites listed, two were repeats under different categories, therefore, really only listing six pages. Of these six, Dr. James' report or home page was not listed and that seemed to disappoint me. I thought for sure I would get his page because the word, "cyber-psychology", appears in both the text and the title.

Next I tried to search webcrawler again by hitting on the link at the bottom of Yahoo!, which is quite convenient. Yahoo! will do a search for you in a number of the other search engines for the same word. In other words, you don't have to go through the process of typing in the keywords and starting from the beginning again. It's just convenient. Well, I got the same four hits again as I did the first time around. At least they were consistent.

I repeated this whole process again a few minutes ago while writing this report and came up with the same results, exact same pages and same number. I guess no new links or registrations into these engines about cyber-psychology in the past two weeks.



What I have learned from using search engines....


My beginnings with search engines tried my patience and I have discovered that I hate waiting in front of a computer screen while things load. It didn't matter how interesting or important I thought the page was going to be, I couldn't help but press the ol' stop button and move on with something new. I have found better ways to do searches. When I first started, I did searches that resulted in tens of thousands of pages that were relevant. There was no way that I could sift through all of that. I began reading the help pages and found better ways of inputting key words and how to use Boolean searching.

I have found that the Web has everything and anything that you want to look for. There is a vast amount of information, useless and relevant. It is addicting and exciting. You can be searching for something is particular and run across something totally off the subject and end up looking at links to those for hours. It keeps the mind stimulated as I think of it. When I am doing searches, I keep jumping to new ideas and new information that looks interesting. I'm learning about all kinds of things that I would have never sat in the library looking at.

My biggest pet peeve of the Internet is the time that it takes for pages to load up. I'll say it again, I hate it. I just press that lovely stop key and move right on. So, if you're page happens to take more that 10 or 15 seconds, sorry, but I can't wait for you. There's way more information out there to be looking at.

So far I have used the Internet for communication purposes, mostly for this class, but now I write and communicate with friends who have graduated and don't get a chance to see as often. I also use it for education and research. I am involved in a class that does a lot of problem-based learning and requires me to research many topics. The internet is handy and quick resource to many references. And lastly, the Internet is used for entertainment. Almost everything is on the Internet and can be looked up. It's like watching T.V., just requires a little more effort.

My Home Page Report 1 Report 2 Report 3 Report 4 Instructor's
Home Page
E-mail Me