From Now On
Vol 7|No 4|January|1998
Searching for the Grail?
Power Searching with Digital Logic
by Jamie McKenzie
No sense sending students and colleagues out to search the Internet or a CD-ROM encyclopedia unless they possess a toolkit of powerful search strategies to speed them past Info-Glut and Info-Garbage to the very information they need.
Power Searching cuts past what David Shenk calls Data Smog.
Learn these ten strategies and you are on your way to information which is pertinent, cogent and worth saving. Converting such information into Insight requires a different set of strategies.
Think and draw before you plunge. Make a list of great questions.
Take advantage of mindware like Inspiration to create a cluster diagram with several dozen concepts, keywords and telling questions which will be powerfully instrumental later when executing your search.
You ask your students to work as teams to propose a mix of laws and government programs which would serve to protect old growth forests while restoring the vitality of this nation's timber industry.
You send them to one particularly intriguing article to start them thinking.
You ask them to work in teams to create cluster diagrams listing as many pertinent concepts and keywords as possible.
Pick the best search engine. Bookmark it.
Stay with that search engine until someone invents a faster and better information trap (which may not take long).
Do not rely upon the "search" buttons built into Netscape or Explorer. These lists create advertising revenue for the software companies and the placement of search engines is a function of revenue rather than value.
How do you pick a good one? Two basic strategies:
No matter which search engine you select, do not settle for the "simple search" version of any. You are sacrificing power and accuracy for ease of use. It's a bit like training wheels on a bike. Go for the Power Search or the Advanced Search and encourage your students to do the same. They offer features which will vastly improve your results when combined with the suggestion in this article.
The more powerful the search engine, the more important the syntax - the rules governing how you enter your search query. Because few people stop to read and learn these rules, they end up with crude and clumsy searches.
For example, some search engines care about CAPITAL LETTERS and punctuation. Others ignore them both. If you search for Washington, D.C. with the following query, you may achieve no "hits" with one search engine and thousands with another . . .
Another example . . . when you want an exact phrase such as "old growth forest," some search engines require quotation marks around the words which belong together, while others do not care.
Another example . . . when you are conducting Boolean searches, some search engines require that you capitalize AND, OR and NOT. Failure to do so may convince your search engine to ignore these critical words.
You can usually find the syntax for any search engine in the Help pages. If not, you may want to look for an engine that explains its rules.
As Info-Glut has grown to be more and more of a problem, the search engines have competed fiercely to offer the best tools to support you in your sorting and sifting, and yet I have witnessed hundreds of folks ignoring these powerful extra tools and features.
Altavista's Advanced Search offers important extra features:
HotBot's Super Search allows you to request particular domains, particular dates, particular levels of a Web site, particular countries of origin, particular types of files and as many as 100 results at a time. These can be very helpful search features.
The Power Searcher explores all of the features of a chosen search engine in advance of real searching in order to apply these extra tools with skill when they are needed.
Effective searching requires a balance between a broad reach and a careful aim. The searcher must cast a net far enough to capture the most important information, and then, once safely contained, must cull the results so that only the best information remains.
Our first search for "old growth forest" produces more than 5,000 hits, many of which are irrelevant. When we REFINE SEARCH as described above, we see that these 5600 hits fall into major categories such AS forest, habitat, ecosystem, lands, conservation, wildlife, etc. Since we are just beginning our exploration, we may want to conduct fresh searches for each of these concepts and use the REFINE SEARCH function once again to see what other ideas, issues and possibilities are associated with our original search phrase.
Too many searchers narrow their search prematurely, thereby condemning themselves to the boundaries and ideas of their prior knowledge.
Most of us do not understand the forest of our question well enough at this stage to start looking at individual trees and bark and leaves.
This is the time to map out the territory into conceptual zones, each of which becomes a neighborhood worthy of more carefully focused exploration.
If we have little prior knowledge of "sustainability" we could not know what we do not know and could not plan or pose the questions we ought to be exploring. We might learn some before we focus narrowly and sharply.
Our search on Altavista provides us with a much richer appreciation of the subtopics associated with sustainability and delivers a list of key words which will direct the more pointed search process.
About 46779 documents match your query.
stewardship - renewable - impacts - indicators
Each of these is a rich vein worthy of further investigation.
Early search efforts are meant to provide an overview of the information landscape relevant to the investigation at hand, much like petroleum prospectors flying over a region and noting the terrain, seeking convergence (a combination of geological elements in one location which hints at the presence of oil).
While it is tempting to start right off opening pages and looking for information, it is more effective to wait until you have scanned the brief descriptions most search engines provide for the hits. Scanning the top 100 hits provides a basis for revising the original search to accomplish two goals:
Think of the first search as a pot-luck supper with a 400 foot long table. Would you step up to the very first dish and start heaping food onto your plate? Or would you browse and graze before making choices?
Some of the best sites on the Internet are not indexed by the search engines. It pays to go to such Web sites and search them directly if they would be the leading source of information on a particular topic.
One outstanding example would be the huge collection of educational research documents housed at ERIC (New AskERIC Homepage). If you perform a search for "middle school guidance" on the free Internet, you are unlikely to find much worth reading (a mere 195 hits with Altavista), while ERIC provides hundreds of abstracts of research articles from several decades.
It turns out that because many public agencies and news media Web sites do not permit access (perhaps for security reasons) to the spiders of search engines, their contents often elude the search engine's efforts as well as our own search attempts.
If you know that the New York Times, the Wall Street Journal, the Seattle Times and the Los Angeles Times have all been giving careful attention to a current lawsuit such as the Justice Department suit against Microsoft, you may achieve far better results by starting your search at their Web sites rather than wasting time with a global search engine which would overlook their offerings.
After several years of looking for a good source of crime statistics using global search engines, I came to realize that I would have done better if I had started with the source . . . the Federal Bureau of Justice, a site which never showed up when I would use global search strategies. I also discovered, almost accidentally, that Money Magazine often provides better statistics on leading cities than most of the cities themselves because it publishes an annual rating of the top cities . . .
What is the best way to know what source or Web site to visit?
After you have conducted several browsing searches, you may begin to focus your search more sharply by adding key words to your search in order to limit hits to pages distinctly relevant to your inquiry.
Careful selection and addition of key words which are discriminating, distinguishing and distinctive, puts the spotlight on just those discrete pages which match your interests. Your key words differentiate, separate and reserve only the best pages.
How different would your results be with each of the following words?
The more you particularize your search, the better your results. Adding particulars and specifics excludes all pages which do not contain those items. The advantage is sharp focus. The danger is bypassing, missing or overlooking key data.
Sometimes it pays to alternate between narrowing and broadening. After zeroing in with some particulars, zoom back out and try some different particulars.
This may also be a good time to use wild cards, truncation, the logical operator "OR" and "exact phrase" syntax.
The most powerful strategy for culling your original findings is the use of the logical operator "NOT" - which may be AND NOT in some search engines (Altavista).
As you browse the first 100 hits, look for patterns and groupings of irrelevant pages. In order to exclude all such irrelevant pages from your collection, add AND NOT with a series of key words contained within parentheses and divided by OR).
If you search for Mayflower do you really want all the motels, hotels, insurance companies and moving companies? If not, try the following search . . .
Mayflower AND NOT (motels OR hotels OR insurance companies OR moving companies)
Power Searching is often more successful when you listen to your intuition and take chances. If you enjoy word play, you will have good luck with the trial-and-error searching process which leads to good results. Listen to your hunches. Be a Sherlock Holmes. Don't be so analytical and logical that you cannot make intuitive leaps. Digital Logic is the attempt to find order in an erratic, chaotic and disorganized information landscape.