GNQL V3 Query

GreyNoise Query Language
GNQL (GreyNoise Query Language) is a domain-specific query language
that uses Lucene deep under the hood. GNQL aims to enable GreyNoise
Enterprise and Research users to make complex and one-off queries
against the GreyNoise dataset as new business cases arise. GNQL is
built with self-defeat and fully featured product lines in mind. If
we do our job correctly, each individual GNQL query that brings our
users and customers sufficient value will eventually be transitioned
into it's own individual offering.
License: The business_service_intelligence response field
requires the BSI Module. Without it, every result returns an empty
business_service_intelligence object; all other fields are
returned normally.

Facets:

  • ip - The IP address of the scanning device IP
  • classification - Whether the device has been categorized as
    unknown, benign, or malicious
  • first_seen - The date the device was first observed by GreyNoise
  • last_seen - The date the device was most recently observed
    by GreyNoise
  • actor - The benign actor the device has been associated with,
    such as Shodan, Censys, GoogleBot, etc
  • tags - A list of the tags the device has been assigned over the
    past 90 days
  • spoofable - This IP address has been opportunistically scanning the
    Internet, however has failed to complete a full TCP connection. Any
    reported activity could be spoofed.
  • vpn - This IP is associated with a VPN service. Activity, malicious
    or otherwise, should not be attributed to the VPN service provider.
  • vpn_service - The VPN service the IP is associated with
  • tor - Whether or not the device is a known Tor exit node
  • cve - A list of CVEs that the device has been associated with
  • single_destination - A boolean parameter that filters source country
    IPs that have only been observed in a single destination country
  • metadata.category - Whether the device belongs to a business, isp,
    hosting, education, or mobile network
  • metadata.carrier - The Internet Service Provider (ISP) or telecommunications carrier
    associated with the source IP address
  • metadata.country - The full name of the country the device is
    geographically located in (This is the same data as
    metadata.source_country. metadata.source_country is preferred)
  • metadata.country_code - The two-character country code of the
    country the device is geographically located in (This is the same data
    as metadata.source_country_code. metadata.source_country_code
    is preferred)
  • metadata.datacenter - The datacenter or hosting provider from which the activity originates.
    This could indicate the use of cloud services, managed hosting,
    or enterprise datacenter infrastructure.
  • metadata.domain - The domain name associated with the source IP address
  • metadata.sensor_hits - The amount of unique data that has been recorded by the sensor
  • metadata.sensor_count - The number of sensors the IP Address has been observed on
  • metadata.city - The city the device is geographically located in
  • metadata.region - The region the device is geographically located in
  • metadata.organization - The organization that owns the network that
    the IP address belongs to
  • metadata.rdns - The reverse DNS pointer of the IP
  • metadata.asn - The autonomous system the IP address belongs to
  • metadata.destination_cities - The city where the GreyNoise sensor is geographically located
  • metadata.destination_asns - The ASN associated with the destination IP address
  • metadata.destination_countries - The full country name where the GreyNoise
    sensors are physically located
  • metadata.destination_country_codes - The country code where the GreyNoise
    sensors are physically located
  • metadata.destination_country - The full country name where the GreyNoise
    sensors are physically located
  • metadata.destination_country_code - The country code where the GreyNoise
    sensors are physically located
  • metadata.latitude - The geographic latitude of the source IP address
  • metadata.longitude - The geographic longitude of the source IP address
  • metadata.rdns_parent - The parent domain retrieved through reverse DNS (RDNS)
    lookup of the source IP address
  • metadata.rdns_validated - A validation status that confirms whether the
    reverse DNS (RDNS) record correctly maps to the source domain
  • metadata.source_country_code - The two-character country code of the
    country the device is geographically located in
  • metadata.source_country - The full name of the country the device is
    geographically located in
  • raw_data.scan.port - The port being targeted on a GreyNoise sensor
  • raw_data.scan.protocol - The protocol of the port the device has
    been observed scanning
  • raw_data.web.paths - Any HTTP paths the device has been observed
    crawling the Internet for
  • raw_data.web.useragents - Any HTTP user-agents the device has been
    observed using while crawling the Internet
  • raw_data.ja3.fingerprint - The JA3 TLS/SSL fingerprint
  • raw_data.ja3.port - The corresponding TCP port for the given JA3
    fingerprint
  • raw_data.hassh.fingerprint - The HASSH fingerprint
  • raw_data.hassh.port - The corresponding TCP port for the given HASSH
    fingerprint
  • raw_data.http.md5 - An MD5 hash of the body content. This compact,
    unique representation of the data allows for quick comparisons and
    deduplication of payloads without storing the raw content.
  • raw_data.http.cookie_keys - The keys or names of cookies exchanged in the
    communication. These can reveal session identifiers, tracking mechanisms,
    or other metadata used in web interactions,
    providing clues about application behavior or vulnerabilities.
  • raw_data.http.request_authorization - The contents of the Authorization header in a request,
    typically containing authentication credentials or tokens (e.g., Basic Auth, Bearer tokens).
    Analyzing this helps verify authorization mechanisms and detect credential misuse or token abuse.
  • raw_data.http.request_cookie - Key-value pairs stored in cookies sent with an HTTP request.
    These cookies often contain session identifiers, user preferences, or tracking data,
    which can be analyzed to detect unauthorized access or manipulation.
  • raw_data.http.request_header - Request Headers are the keys (names) of HTTP headers that a
    client sends to a server.
  • raw_data.http.request_method - The HTTP method used in the request, such as GET, POST, PUT, or DELETE.
    Analyzing methods can reveal the intent of the request, such as retrieving or modifying resources,
    and identify unexpected or suspicious activity.
  • raw_data.http.request_origin - Indicates the origin of the request, typically used in
    cross-origin resource sharing (CORS) to specify where the request originated.
    This helps identify unauthorized or potentially malicious cross-origin requests.
  • raw_data.tls.cipher - The encryption algorithm or cipher suite used during
    the secure communication. Identifying the cipher helps assess the
    security of the connection, particularly in TLS/SSL traffic.
  • raw_data.tls.ja4 - JA4 TLS fingerprint. JA4 captures distinctive
    characteristics of TLS client behavior, useful for identifying and
    clustering malicious or anomalous clients.
  • raw_data.http.ja4h - JA4H HTTP client fingerprint. Captures
    characteristics of HTTP client behavior including method, headers,
    and cookie fields, useful for identifying and tracking HTTP clients.
  • raw_data.ssh.ja4ssh - JA4SSH fingerprint. Captures SSH traffic
    patterns including packet lengths and directions, useful for
    identifying SSH client behavior and detecting anomalous sessions.
  • raw_data.tcp.ja4t - JA4T TCP fingerprint. Captures TCP
    connection characteristics such as window size, options, and MSS,
    useful for OS fingerprinting and identifying network stacks.
  • raw_data.tcp.ja4l - JA4L light distance/latency fingerprint.
    Captures TCP TTL and window size characteristics, useful for
    estimating client-server distance and identifying proxied connections.
    Behavior:
  • raw_data.ssh.key - This is the SSH key used.
  • You can subtract facets by prefacing the query with a minus character
  • The data that this endpoint queries refreshes once per hour
    Shortcuts:
  • You can find interesting hosts by using the GNQL query term
    interesting
  • You can use the keyword today in the first_seen and
    last_seen parameters: last_seen:today or first_seen:today
    Examples:
  • last_seen:today - Returns all IPs scanning/crawling the
    Internet today
  • tags:Mirai - Returns all devices with the "Mirai" tag
  • tags:"RDP Scanner" - Returns all devices with the "RDP
    Scanner" tag
  • classification:malicious metadata.country:Belgium
  • Returns all compromised devices located in Belgium
  • classification:malicious metadata.rdns:*.gov* - Returns
    all compromised devices that include .gov in their reverse DNS records
  • metadata.organization:Microsoft classification:malicious
  • Returns all compromised devices that belong to Microsoft
  • (raw_data.scan.port:445 and raw_data.scan.protocol:TCP) metadata.os:Windows* - Return all devices scanning the Internet
    for port 445/TCP running Windows operating systems
    (Conficker/EternalBlue/WannaCry)
  • raw_data.scan.port:554 - Returns all devices scanning the
    Internet for port 554
  • -metadata.organization:Google raw_data.web.useragents:GoogleBot
  • Returns all devices crawling the Internet with "GoogleBot" in
    their useragent from a network that does NOT belong to Google
  • tags:"Siemens PLC Scanner" -classification:benign - Returns
    all devices scanning the Internet for SCADA devices who ARE
    NOT tagged by GreyNoise as "benign"
    (Shodan/Project Sonar/Censys/Google/Bing/etc)
  • classification:benign - Returns all "good guys" scanning
    the Internet
  • raw_data.ja3.fingerprint:795bc7ce13f60d61e9ac03611dd36d90
  • Returns all devices crawling the Internet with a matching
    client JA3 TLS/SSL fingerprint
  • raw_data.hassh.fingerprint:51cba57125523ce4b9db67714a90bf6e
  • Returns all devices crawling the Internet with a matching
    client HASSH fingerprint
  • raw_data.tls.ja4:t13d1516h2_8daaf6152771_02713d6af862
  • Returns all devices with a matching JA4 TLS fingerprint
  • raw_data.http.ja4h:ge11cn060000_4e59edc1297a_4da5efaf0cbd
  • Returns all devices with a matching JA4H HTTP fingerprint
  • raw_data.ssh.ja4ssh:c76s76_c71s59_c0s0
  • Returns all devices with a matching JA4SSH fingerprint
  • raw_data.tcp.ja4t:64240_2-1-3-1-1-4_1460_8
  • Returns all devices with a matching JA4T TCP fingerprint
  • raw_data.tcp.ja4l:1460_64
  • Returns all devices with a matching JA4L light
    distance/latency fingerprint
  • raw_data.web.paths:"/HNAP1/" -Returns all devices crawling
    the Internet for the HTTP path "/HNAP1/"
  • 8.0.0.0/8 - Returns all devices scanning the Internet from
    the CIDR block 8.0.0.0/8
  • cve:CVE-2021-30461 - Returns all devices associated with the
    supplied CVE
  • source_country:Iran - Returns all results originating from Iran
  • destination_country:Ukraine single_destination:true
  • Returns all results scanning in only Ukraine
Query Params
string
required

GNQL query string

integer
1 to 10000
Defaults to 10000

The number of results provided per page for paginating through all results of a query

string

Scroll token to paginate through results. Incompatible with format=csv.

boolean
Defaults to false

If true, the response will only include the IP address and the classification or trust level.

string
enum
Defaults to json

Specifies the desired format of the results. Must be either csv or json.

Allowed:
string

Comma-separated list of fields to exclude from the response.
Recognized top-level response fields (e.g. tags, cves, vpn, tor, raw_data, metadata),
metadata.<subfield> paths (e.g. metadata.organization, metadata.source_country,
metadata.destination_countries), and raw_data.<subfield> paths (e.g. raw_data.ja3,
raw_data.http.useragent) are accepted. The special value tags.details preserves tag
identity (id, slug) and strips only the enriched details. Unknown field names return 400.

Responses

Language
Credentials
Header
URL
LoadingLoading…
Response
Click Try It! to start a request and see the response here! Or choose an example:
application/json