http://www.shodanhq.com/ has been added to the list for Net Sec deep web assets. Should have added it 2 years ago, my bad.
Archive for deep web resources
We’ve added Getty, they have part of their collection online.
Getty Research Institute – http://www.getty.edu/research/library/ – The Getty Research Institute library collections include over one million books, periodicals, study photographs, and auction catalogs as well as extensive special collections of rare and unique materials. Focusing on art history, architecture, and related fields, they begin with the archaeology of prehistory and extend to the contemporary moment.
Getting sources of Data is always a problem when tackling a statistical or data mining project. Here are two very nice deep web assets:
http://databank.worldbank.org/data/home.aspx – Speciality statistical data on all kinds of subjects, from countries GDP to levels of blindness.
http://www.quandl.com/ – An awesome collection of 9,000,000 of financial, economic, and social datasets.
Was shown a nice search engine for embedded, zipped, or otherwise obscured files.
Looks in FTP, zip, RAR and other formats.
Also added the National Security Archive
While researching on the current massive meteorite strike in Russia this morning, one deep web resource has shown how useful it is for extracting out videos for local OSInt.
It is being added to the deep web list
Ok, so maybe I missed a few lectures in college. Perhaps quite a few. But via deep web research, I’ve found an excellent deep web science lecture resource for high authority, peer reviewed, lectures covering science and technology. Known at videolectures.net and provided free of charge to the public. The library contains 15,000 lectures of which 66% are in English. Provided to us for free courtesy of the Center for Knowledge Transfer subsidized by Slovenia. Hats off to those guys.
Discovered and Tested out some new engines. The main catch of the day is the Bielefeld BASE search engine. Very nice, very deep. Behind that I’ve stumbled onto the CiteSeerx for computational science. Useful for a infomation academic searches, like searching out for ‘string matching’ such as Jaro Winkler and Levenstein distance metrics.
Bielefeld Academic Search Engine – www.base-search.net – BASE is one of the world’s most voluminous search engines especially for academic open access web resources. BASE is operated by Bielefeld University Library. Pulls from over 2,200 sources, mainly Universities. (see source list )
CiteSeerx – http://citeseer.ist.psu.edu/index – Mainly focus on articles for computer and informational science
Refseek – http://www.refseek.com – Currently in Beta, it attempts to search academic, open source, newspaper, and web content for academic content. No where near as comprehensive or accurate in locating documentation as the Base search hosted by Bielefeld, but it has promise.