Yahoo Search Wants to Be More Like Google, Embraces Hadoop
Yahoo is following in GoogleÃ¢â‚¬â„¢s footsteps again in search. Today, it is shifting a crucial part of its search engine to Hadoop, software that handles large-scale distributed computing tasks particularly well. Hadoop is an open-source implementation of GoogleÃ¢â‚¬â„¢s MapReduce software and file system. It takes all the links on the Web found by a search engineÃ¢â‚¬â„¢s crawlers and Ã¢â‚¬Å“reducesÃ¢â‚¬Â them to a map of the Web so that ranking algorithms can be run against them.
Yahoo is replacing its own software with Hadoop and running it on a Linux server cluster with 10,000 core processors.
Read more: techcrunch.com