My Thoughts & Views

Archive for July 2nd, 2007

2 July 1956 Death of Nostradamus French physician and astrologer who made predictions of the future in his Centuries.

As you all know Google is attempting to digitize all the printed material available. Google uses OCR to make the printed material to be readable by the search engine crawlers.This means that the scanned page doesn’t appear as an image to the search engine. The words can be deciphered and so keywords etc…can be used in the same way as they would for web documents.

How OCR can help common people?
Suppose you have written something all over a whiteboard and then realised that this is good stuff and you want to keep it all, but it’s too much to write out? Did you get your camera or mobile out and take a photo? This is where OCR can really be efficient, you could it into your computer and it would still be able to read the text (of course if the handwriting and pattern recognition is also included).

We all are aware of the Google’s PageRank Based Searching algorithm to produce the search results.

But I think there are certain limitations to this type of algorithm. Following are the limitations i think:
1. New pages have less page rank and they take much time to be get listed and gain high ranks.
2. If some one inaccurately quote something on an web page then subsequent readers also quotes it on another web page, search engines index all of the inaccurate pages, and we end up with a mess where fiction is accepted as reality.
3. Search results are based on the
literal(keywords, tags,meta data) things but not on meaning.

I think the move of Google to provide custom search engines may help to reduce the above mentioned limitations. Or somebody else take care of these limitations and they may dominate the Google dominated world.

Let me know about your views on the limitations of PageRank and Google.

Let us hope that in future we can get very accurate results based on the present researches in these areas.

We all are aware of the Google’s PageRank Based Searching algorithm to produce the search results.

But I think there are certain limitations to this type of algorithm. Following are the limitations i think:
1. New pages have less page rank and they take much time to be get listed and gain high ranks.
2. If some one inaccurately quote something on an web page then subsequent readers also quotes it on another web page, search engines index all of the inaccurate pages, and we end up with a mess where fiction is accepted as reality.
3. Search results are based on the
literal(keywords, tags,meta data) things but not on meaning.

I think the move of Google to provide custom search engines may help to reduce the above mentioned limitations. Or somebody else take care of these limitations and they may dominate the Google dominated world.

Let me know about your views on the limitations of PageRank and Google.

Let us hope that in future we can get very accurate results based on the present researches in these areas.

Cross-Language Information Retrieval (CLIR) : This is the technique used to retrieve the information stored in different languages. We all are aware of the Google’s translation services, Google is now working on CLIR to translate search queries to other languages and back again for the results.

How it Helps?
It helps to break the language barrier by finding anything in any language.And also it helps to digitize the old manuscripts and helps people find the information from old books written in forgotten languages.

We all know about the search engine gaint Google, recently i came across a site called searchmash.com which is operated by google.

Is Google researching on secret search engine ? Take a look at Searchmash.

Why another search engine?Is Google Experimenting with new ways of searching the web?Why its not listed in google labs?

If you find any information let me know too.


Blog Stats

  • 70,638 hits
July 2007
M T W T F S S
« Jun   Aug »
 1
2345678
9101112131415
16171819202122
23242526272829
3031  

Top Clicks

  • None