Create query
To use Snyk Code custom rules to create queries with suggestive AI support, you can choose from the provided templates and predicates. Alternatively, you can create your own predicates and save them as a custom rule.
Consider the following query examples and rules to use with Snyk Code custom rules. A CWE 312 query example is provided on this page.
Simple syntactical query
Copy the following source code snippet in the snippet window and select C# as the language
It is only a snippet and not a full program. It will not compile.
Running the query
Enter the following queries in the query window and press Run Query to see the results.
Select
body
by using the query:“body”
This query does not select the Body with a capital B. The query language is case-sensitive.
Add
Body
to the findings so the query becomesOr<”body”,”Body”>
.You can achieve the same outcome using a regex
~"body|Body"
or~"[Bb]ody"
Do something more complex regex and query:
~"[a-z0-9!#$%&'*+/=?^_{|}~-]+(?:\.[a-z0-9!#$%&'*+/=?^_`{|}~-]+)*@(?:[a-z0-9](?:[a-z0-9-]*[a-z0-9])?.)+[a-z0-9](?:[a-z0-9-]*[a-z0-9])?"
It matches the hardcoded email address.
Try it yourself
Run the following query over your code ~"([a-zA-Z0-9+/]{40})"
If you find something, check it out first, as you might leak your AWS secrets.
If you are interested in a certain type of object, you can use templates. For example, the query CallExpression<"Format">
matches a function call or Literal<"nobody@notrealdomain.co.uk">
matches the string with the email address.
A data flow or taint analysis
For this example, a JavaScript code snippet is used. You can copy it in the snippet window and select JavaScript.
Snyk Code knows a list of possible sources of external data in the predicate PRED:AnySource
. The following query shows you that app.post()
is identified.
Query PRED:SqliSinks
shows you that query()
is part of that list of SQL injection sinks. The query engine comes with many different predicates for various source, sink, and sanitizer types. Check the list of predicates to see them all.
To check whether the data flows into a SQL injection sink, use the following: DataFlowsInto<PRED:SqliSink>
. It shows you that in the program, data from the req
parameter flow into query()
taking several turns.
If the data flow is also going through a sanitizer, you can use a specialized template. Change the query to Taint<PRED:AnySource, PRED:SqliSanitizer, PRED:SqliSink>
There is nothing language-specific in the query. It would work on similar code in other languages.
Net new data flow rule
Create a new rule because Snyk is not aware of the proprietary source built in-house, resulting in missed findings.
Use a data flow template known as Taint
when creating a data flow query.
You can configure the following parameters:
Source: The first parameter indicates where the data flow starts.
Sanitizer: The second parameter indicates a known sanitizer that would sanitize the data, resulting in the data's not being tainted
Sink: The third parameter indicating where the data flow ends
Custom predicates are indicated by writing their names within brackets. In this scenario, the custom method is called SourceFoo
.
With this query, you can look for the data flow that originates in SourceFoo
. A source unknown to Snyk ends up in a known vulnerable cross-site scripting (XSS) Sink and does not pass through a known cross-site scripting (XSS) Sanitizer. Therefore, the assumption is that the data is tainted.
Extend a data flow rule
Recreate a Snyk rule and add a source to the current Snyk known vulnerable source list because they are not being taken into account in the scans, resulting in missed vulnerabilities.
Like the Net new data flow rule, the Taint
data flow template is used with an Or
operator. Operators are available to create logical statements for your queries, such as Or
or And
.
Run the data flow rule using both the Snyk known sources but also a custom source called .
With this query, you look for the data flow that originates in a Snyk known source OR “SourceFoo
” . A source unknown to Snyk ends up in a known vulnerable cross-site scripting (XSS) Sink and does not pass through a known cross-site scripting (XSS) Sanitizer. Therefore, the assumption is that the data is tainted.
Any statement that uses an operator will be written within angle brackets < statement >
.
Context added to data flow rule
Recreate a Snyk rule and remove a source from the current Snyk known vulnerable sources because this source is not vulnerable within the context of an application.
Like the Net new data flow and Extend a data flow rules, the Taint
data flow template is used with an And
operator. A declarative negative statement (Not
) is used to indicate the false case of the statement and not the true case.
Run the data flow rule using the Snyk known sources, removing SnykSource
from the results. In this example, SnykSource
is a Snyk-known source that is used within the regular general AnySource
predicate.
With this query, you look for the data flow that originates in a known Snyk source but remove results that come from SnykSource
that end up in a known vulnerable cross-site scripting (XSS) Sink and do not pass through a known cross-site scripting (XSS) Sanitizer. Therefore, the assumption is that the data is tainted.
High recall mode
See all the sources and sinks in the source code to understand every location where data can flow from or to. This analysis is conducted regardless of the presence of data flows, allowing users to assess coverage comprehensively.
This mode is often used as an investigatory tool to gain deeper insights into the code stack and to comprehend where data originates and terminates within the application.
To initiate this analysis, use the AnySource
or AnySink
predicates, which encompass all Snyk-known sources and sinks, respectively.
This query identifies and highlights every known source within the code being analyzed.
Similarly, this query identifies and highlights every known sink within the code, providing a complete view of potential data endpoints.
Assessing coverage and identifying missing data flow links
Identifying gaps in data flow paths is crucial for understanding the destinations of user data, especially if it does not end up in expected locations such as databases or file systems. These gaps may reveal the use of unsupported libraries or frameworks—or components thereof—potentially leading to false negatives. This insight is essential for comprehensive security assessments and ensuring robust coverage.
Out of the box, Snyk will not show any vulnerabilities for these two classes, so the following query will not yield any results in the code snippet above.
Find unmatched sources
The following query enhances security coverage by revealing unmatched sources, pinpointing situations where the HttpServletRequest
parameter in theWebServlet
's doPost
method is not linked to known sinks, thus identifying gaps in data handling.
Find unmatched sinks
Similarly, to improve coverage, another query locates unmatched sinks by finding elements like the java.sql.Connection
object in the WebServer
class that are poised to receive data but lack incoming data flows, highlighting areas for potential gaps in library and framework coverage.
CWE 312 query example
In the context of Static Application Security Testing (SAST), identifying vulnerabilities associated with CWE-312 poses complex challenges in terms of analysis and accuracy. Specifically, the following concerns must be addressed:
Data Sensitivity Classification: Accurately categorizing which data elements are sensitive and which are not is a non-trivial task. An incorrect classification can result in false positives.
Sink Protection Validation: Another challenge is the identification of data sinks (endpoints where data is stored or transmitted) that are adequately secure.
Geographical Data Protection Requirements: Moreover, the data protection regulations can vary significantly depending on the jurisdiction.
Given these complexities, custom rules offer a more flexible and tailored approach for detecting CWE-312 vulnerabilities. In this example, we will employ the C# programming language and leverage the Taint
template within the Custom Rules framework to address these challenges.
Let's start with a simple program:
Building a rule that matches on sensitive data (username
) being sent to a text file is simple enough. A naive first approach is:
This matches a sensitive data flow from ReadLine
to WriteAllText
. Given the broad nature of the rule, it may create quite a lot of noise.
Restricting to specific log files only
The first caveat is that perhaps only testFile.txt
is considered unsafe. Files like cache.txt
should be considered safe.
To achieve this, we use the CallExpression
and HasArg1
templates.
CallExpression
and HasArg1
may also be used on their own. Connecting them using the and
operator establishes the relationship, and Snyk Code will attempt to match them in combination only.
Catching all File writers
In .NET, files can be written using WriteAllText
. There are also WriteAllLines
and WriteAllBytes
.
Our code snippet may be extended this way:
Let's capture the variants first using regular expressions. We will look for the functions and also both filename variants (.bin
and .txt
):
Notice how CallExpression now
contains a regular expression, whereas HasArg1
uses the or
operator. It could be written either way.
Finally, let's add support for the .NETAsync
variants and also the Append
methods:
Defining "interesting data"
In our previous example, we treated ReadLine
as a source of sensitive data. Let's consider a simple object with only specific sensitive fields:
For this class, only EmailAddress
should be considered.
Therefore, given a code snippet like this:
The first call to WriteAllText
should be prevented while the second call is allowed. To accomplish this, you can defer to specifying the field name in the query:
Last updated