Space Industry and Business News  
Researchers Use Wikipedia To Make Computers Smarter

Made for Wikipedia.
by Staff Writers
Tel-Aviv, Israel (SPX) Jan 10, 2007
Researchers at the Technion-Israel Institute of Technology have found a way to give computers encyclopedic knowledge of the world to help them "think smarter," making common sense and broad-based connections between topics just as the human mind does.

The new method will help computers filter e-mail spam, perform Web searches and even conduct electronic intelligence gathering at a much more sophisticated level than current programs, according to researchers Evgeniy Gabrilovich and Shaul Markovitch of the Technion Faculty of Computer Science. The findings will be presented next week in Hyderabad, India during the Twentieth International Joint Conference for Artificial Intelligence.

The program devised by the Technion researchers helps computers map single words and larger fragments of text to a database of concepts built from the online encyclopedia Wikipedia, which has over one million articles in its English-language version. The Wikipedia-based concepts act as "background knowledge" to help computers figure out the meaning of the text entered into a Web search, for instance.

Giving computers this deeper knowledge has been a long-standing problem in artificial intelligence, according to Markovitch. "Humans use a significant amount of background knowledge" to understand text, "but we didn't know how to have computers access such knowledge," he said.

Most Web search and e-mail filter programs appear smart by calculating how often certain words appear in two texts, Markovitch explained. "But what is common to all these applications is that the programs that actually do this kind of thing don't understand text. They treat text as a collection of words, but they don't understand the meaning of words."

This shallow understanding is what makes an e-mail spam filter block all messages containing the word "vitamin," but fail to block messages containing the word "B12." "If the program never saw "B12" before, it's just a word without any meaning. But you would know it's a vitamin," Markovitch said.

"With our methodology, however, the computer will use its Wikipedia-based knowledge base to infer that "B12" is strongly associated with the concept of vitamins, and will correctly identify the message as spam," he added.

Or, computers could look at a chunk of text about Saddam Hussein and weapons of mass destruction and know that it is conceptually related to topics such as the Iraq war and U.S. Senate debates on intelligence-even if those terms do not appear anywhere in the original text.

The method also helps computers figure out ambiguous terms-deciding, for instance, whether the word "mouse" refers to the computer device or the fuzzy animal. This can be especially important in translated documents, Markovitch said.

In the near future, the Technion researchers hope to improve their method by adding information from the Web page links inside Wikipedia articles. They are already pursuing a patent on their work, which they say will be of interest to the intelligence community and Web search engine companies, among others.

Related Links
American Technion Society
Virginia Tech
All about the technology of space and more
Space Technology News - Applications and Research



Memory Foam Mattress Review
Newsletters :: SpaceDaily :: SpaceWar :: TerraDaily :: Energy Daily
XML Feeds :: Space News :: Earth News :: War News :: Solar Energy News


DOE Office Of Science Awards 95 Million Hours Of Supercomputing Time To Advance Research
Washington DC (SPX) Jan 09, 2007
The U.S. Department of Energy's (DOE) Office of Science announced today that 45 projects were awarded a total of 95 million hours of computing time on some of the world's most powerful supercomputers as part of its 2007 Innovative and Novel Computational Impact on Theory and Experiment (INCITE) program. DOE's Under Secretary for Science Dr. Raymond Orbach presented the awards at the Council on Competitiveness in Washington, DC.







  • Chinese Web Could Remain Slow Until Late January
  • 10000 Chinese Domain Names Vanish Amid Web Chaos
  • The Internet -- A Fragile System Threatened By Natural Disaster
  • Internet Resumption Still Shaky After Taiwan Quake

  • All Four Satellites In Healthy Condition After PSLV Launch
  • India Tests Technology For Space Vehicles
  • PSLV Successfully Launches Four Satellites
  • Arianespace To Launch ProtoStar I

  • USGS Examines Environmental Impacts Of Aircraft De-Icers
  • China Gives Rare Glimpse Of Homegrown Jet Fighter

  • Skynet 5A Touches Down In French Guiana
  • Boeing To Begin Second Phase Of Enhanced Polar System Payload Study
  • HisdeSat To Provide Communications Services For The Belgium Defence Ministry

  • LockMart Completes Tracking With Open Architecture And Solid-State Radar Antenna
  • University Of Chicago Receives Supercomputer Time For Supernova Simulations
  • Metamaterials Found To Work For Visible Light
  • Researchers Use Wikipedia To Make Computers Smarter

  • Amazon Founder Recruiting For Private Space Program
  • Space Command Civilian Volunteers To Deploy Down Range

  • QuikScat Shows Rough Seas And Atmospheric Conditions At Time Of Two Java Sea Disasters
  • Japanese Scientists Discover Huge Undersea Lava Plateau
  • Raytheon Delivers VIIRS Sensor Engineering Development Unit
  • Northrop Grumman To Develop System Requirements For USAF Alternate Infrared Sat System

  • BAE Systems Demonstrates Passive Geo-location Technology
  • Mobile Navigation More Accessible Than Ever
  • Russian Defense Ministry Lifts GLONASS Restrictions
  • BAE Systems Demonstrates Passive Geo-location Technology

  • The content herein, unless otherwise known to be public domain, are Copyright Space.TV Corporation. AFP and UPI Wire Stories are copyright Agence France-Presse and United Press International. ESA Portal Reports are copyright European Space Agency. All NASA sourced material is public domain. Additional copyrights may apply in whole or part to other bona fide parties. Advertising does not imply endorsement, agreement or approval of any opinions, statements or information provided by Space.TV Corp on any Web page published or hosted by Space.TV Corp. Privacy Statement