Many aren’t aware of the sheer power that the Google offers users when it comes to information gathering. Google has literally hundreds of thousands of servers that actively collect information about the Internet.
They have programs called web crawlers that do just that, crawl the web reporting back to Google what they have found. You would be hard pressed to find a site that doesn’t have a google bot on it at any given time, or a site running the google-analytics script in it’s source code reporting things back as well.
So what kinds secrets does Google hold? They have after all been indexing every movement on the web for over 10 years.
Well, they hold alot. Let’s go over some of the techniques for pulling data out the google that would be otherwise hard to find.
Let’s take a look at these obscure operands pulled from the web:
Category: Files Containing Passwords
intitle:”Index of” .mysql_history
The .mysql_history file contains commands that were performed against a mysql database. A “history” of said commands. First, you shouldn’t show this file to anyone, especially not a MAJOR SEARCH ENGINE! Secondly, I sure hope you wouldn’t type anything sensitive while interacting with your databases, like oh say USERNAMES AND PASSWORDS…
filetype:xls username password email
This search shows Microsoft Excel spreadsheets containing the words username, password and email. Beware that there are a ton of blank “template” forms to weed through, but you can tell from the Google summary that some of these are winners… err losers.. depending on your perspective.
inurl:passlist.txt
Cleartext passwords. No decryption required!
Category: Various Online Devices (Very Fun)inurl:indexFrame.shtml Axis
The AXIS 2400 is a Web server of its own. This means that the server is secured like any other Internet host. It is up to the network manager to restrict access to the AXIS Web Cameras camera server. AXIS Network cams have a cam control page called indexFrame.shtml which can easily be found by searching Google. An attacker can look for the ADMIN button and try the default passwords found in the documentation. An attacker may also find that the directories are brow-sable. Additional security related information was found on the Internet.Securityfocus(www.securityfocus.com):—————————————————-”It has been reported that the Axis Video Servers do not properly handle input to the ‘command.cgi’ script. Because of this, an attacker may be able to create arbitrary files that would result in a denial of service, or potentially command execution.” Core Security Technologies Advisory (http://www.coresecurity.com):—————————————————”We have discovered the following security vulnerability: by accessing http://camera-ip//admin/admin.shtml (notice the double slash) the authentication for “admin” is bypassed and an attacker gains direct access to the configuration.
All in all there is really one place onlin that has been the mecca for Google “Hacking” The Google Hacking database for Jonny I hack stuff.
http://johnny.ihackstuff.com/ghdb/ There is a collection of operands like you never seen before.
It doesn’t end there though. Some people have discovered their own operands that can pull amazing things from the web and will never make them public. You should have the basic idea of how they can go from reading this blog and visiting that site.
experiment with these operators to see what you can mine from google:
allinanchor:, allintext:, allintitle:, allinurl:, cache:, define:, filetype:, id:, inanchor:, info:, intext:, intitle:, inurl:, phonebook:, related:, site:
After the semi colon you would simply add what your looking for. ie. allintitle:cellphone secrets
This will mine various postings with content related to that and related to exactly that. You can pull things you are not supposed to see like this also.
why this works?
It works because googlw has access parts of web sites that regular users do not have. when the Google bots see content, they send it back to their servers for indexing. Thus making it searchale if you figure out what to type. Things like email addresses,passwords, hidden logins and sensitive directories are not exempt. Try these teqniques on yourself and see what info Google holds on you. Many forums I belong to will show my post instantly on Google as soon as I serach for my forum name….
happy hunting. Cheers!