• Home
  • /Archive by category ' Web & Technology '
  • /Page 17

Archive For: Web & Technology

Common Crawl: The Open Search Engine

Our mission is to democratize access to web information by producing and maintaining an open repository of web crawl data that is universally accessible. We store the crawl data on Amazon’s S3 service, allowing it to be bulk downloaded as well as directly accessed for map-reduce processing in EC2.…

Read more →

 

Applying Random Surfer Model to Peer-to-Peer Network Distribution

Author: Dan Petrovic, http://dejanseo.com.au

Digital information preservation is a hot topic and a fertile ground for many bubbling solutions and models in both practice and theory. One of the emerging issues revolves around the fact that there is more information being produced today than we’re able to store and analyse, not to mention attempts at prioritisation and archiving.…

Read more →

 

Google Analytics: Removing Search Query Data

Yesterday Google released an article on the official Google Analytics blog [1] outlining a new change to the reporting of organic traffic which was rolled out at the same time as the article. Basically, Google claims that they are now trying harder to “protect” their users by no longer reporting the organic query terms of anyone who is logged into their google.com account.…

Read more →