OSINT - Open Source Intelligence

What is OSINT?

OSINT (Open Source Intelligence) is the systematic gathering and analysis of information from publicly accessible sources. The term comes from the intelligence world, where analysts used it to distinguish secret from public sources. In cybersecurity, OSINT is the first phase of both an attack and a pentest. The goal is to find out what is publicly available about a target, without sending a single packet toward their infrastructure. MITRE ATT&CK categorizes this under the tactic Reconnaissance (TA0043).

OSINT is not hacking. Nothing is abused or bypassed. Everything OSINT yields is already on the internet for whoever knows where to look.

Which sources does OSINT use?

The value of OSINT lies in combining different sources into one coherent picture.

DNS, WHOIS and certificate transparency logs reveal subdomains, mail servers and historical records. Search engines with targeted dorks such as site:example.com filetype:pdf surface documents that are not findable through a website’s navigation. LinkedIn often gives a near-complete picture of the org chart, the technologies in use and recent staff changes.

Code repositories such as GitHub and GitLab regularly contain leftover API keys, database strings or internal configurations. PDFs and Office documents contain metadata with author names, software versions and file paths. Internet scanners such as Shodan and Censys show internet-facing services and certificates. Breach databases such as HaveIBeenPwned show which email addresses of your employees once turned up in a breach. The Wayback Machine and company registers (such as the Dutch Chamber of Commerce) provide historical context. Think of a page you had removed but that is still viewable, or a director who left a while ago.

OSINT in a pentest

In a corporate network pentest, OSINT usually starts with scope verification. Which domains and IP ranges are really yours, and which of those are publicly reachable? That almost always surfaces at least one subdomain or host that nobody had on their radar anymore.

For a phishing simulation, OSINT is the basis for the target list. We combine email addresses we find in public sources with names and roles from LinkedIn, so the phishing attack fits believably into the context of your organization. With spear phishing we go deeper. Personal contexts, recent projects and public statements by the target then go into the phishing scenario the attacker sets up.

During every OSINT round we watch specifically for sensitive information in code repositories or document metadata. It regularly happens that a developer makes a quick test commit with a real API key in it and never reverts it.

What can an attacker do with OSINT?

OSINT on its own does no damage. The information found forms the basis for what comes after.

With a list of email addresses and names, phishing becomes effective. An email from “HR” with the real name of your HR manager gets further than a generic phishing blast. A leaked API key from a GitHub commit gives direct access to your cloud environment, without an attacker having to guess a single password. A forgotten subdomain that points to an expired cloud resource is a classic subdomain takeover, letting an attacker place content under your own domain name. A software version listed in document metadata reveals which vulnerabilities he can try in a targeted way.

In practice OSINT is rarely the hardest part of an attack. It is the part where an attacker decides whether it is worth going further.

How do you reduce your OSINT footprint?

You cannot make your organization completely invisible, and that is not the goal either. You can take away the easy wins for an attacker:

Strip metadata from documents before publication. PDF converters and Office settings usually offer an option to remove personal data and file paths.
Run secret scanning on your code repositories. Tools such as gitleaks and trufflehog find hardcoded API keys, including in old commits. GitHub and GitLab offer similar native scanning.
Clean up DNS records. Wildcard records, old staging subdomains and forgotten cloud resources are the first finds in every OSINT round.
Monitor breach databases for your own domain. HaveIBeenPwned offers a free email alert for domains, so you know when credentials have leaked and can force those passwords to be reset.
Discuss with your employees which information they share on LinkedIn. A detailed description of your internal IT stack does not belong there.
Run an external footprint audit periodically. That can be a standalone job or part of a pentest where it has been agreed.

None of these measures makes OSINT impossible. They do raise the effort an attacker has to put in before finding anything usable.

WEB

MOBILE

INFRA

PHYSICAL

CLOUD

REVIEW

COMPLIANCE

WE ARE |

WE ARE

HACKIFY

OSINT - Open Source Intelligence

What is OSINT?

Which sources does OSINT use?

OSINT in a pentest

What can an attacker do with OSINT?

How do you reduce your OSINT footprint?

Frequently asked questions about OSINT

Is OSINT legal?

What is the difference between passive and active OSINT?

How long does an OSINT round take in a pentest?

How do I know what is publicly findable about my organization?

Related articles

Nmap - Network Mapper

LSASS - Local Security Authority Subsystem Service

Kerberoasting