docs: Add 'Customizing library models for Rust' documentation by coadaflorin · Pull Request #21727 · github/codeql

coadaflorin · 2026-04-17T09:04:24Z

Summary

Adds a new documentation page: Customizing library models for Rust, following the pattern of existing documentation for other languages:

What's included

The documentation covers Rust-specific concepts:

Extensible predicates: sourceModel, sinkModel, summaryModel, neutralModel with Rust's simplified 3-5 column schema (vs Java/Go's 9-10 column schema)
Canonical paths: How Rust identifies callables using fully-qualified paths (crate::module::function, <Type>::method, <Type as Trait>::method, <_ as Trait>::method)
Rust-specific access path tokens: Reference (for &T), Future (for async), Field with Rust enum variant syntax
Examples using real models from the codebase:
- SQL injection sink with sqlx
- Remote source from reqwest::get
- Environment variable source from std::env::var
- Flow summary through reqwest::Response::text (async)
- Flow summary through std::path::Path::join (multiple inputs)
- Flow summary through Iterator::map (higher-order, wildcard trait)
- Neutral model for Option::map
Reference sections for predicates, access paths, source/sink/summary kinds, and threat models

Changes

New file: docs/codeql/codeql-language-guides/customizing-library-models-for-rust.rst
Modified: docs/codeql/codeql-language-guides/codeql-for-rust.rst — added toctree entry and description

Add documentation for customizing library models for Rust using data extension files. This follows the pattern of existing documentation for other languages (Java, Python, Ruby, Go, C#, C++, JavaScript). The documentation covers: - Rust-specific extensible predicates (sourceModel, sinkModel, summaryModel, neutralModel) with their simplified schema - Canonical path syntax for identifying Rust functions and methods - Examples using real models from the codebase (sqlx, reqwest, std::env, std::path, Iterator::map) - Access path token reference (Argument, Parameter, ReturnValue, Element, Field, Reference, Future) - Source and sink kind reference - Threat model integration Also updates codeql-for-rust.rst to include the new page in the toctree. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Add barrierModel and barrierGuardModel sections to the Rust library models documentation, following the pattern established in PR #21523 for other languages. Includes: - New extensible predicate descriptions in the overview - Example: barrier for SQL injection using escape_sql - Example: barrier guard for path injection using is_safe_path - Reference material for both barrierModel and barrierGuardModel Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

geoffw0

Partially reviewed. I need to continue from "Examples of custom model definitions", then check final rendering and links. We will also want a docs team review at some point.

geoffw0 · 2026-04-17T12:37:20Z

+- **Free functions**: ``crate::module::function``, for example ``std::env::var`` or ``std::fs::read_to_string``.
+- **Inherent methods**: ``<Type>::method``, for example ``<std::fs::File>::open``.
+- **Trait methods with a concrete type**: ``<Type as Trait>::method``, for example ``<std::fs::File as std::io::Read>::read_to_end``.
+- **Trait methods with a wildcard type**: ``<_ as Trait>::method``, for example ``<_ as core::clone::Clone>::clone``. This form matches any type that implements the trait and is useful for modeling broadly applicable trait methods.


I don't see this section in the doc for other languages, I think Copilot may have synthesised it entirely ... but it looks really helpful, and as far as I can tell, correct.

As someone with no familiarity with rust, it looks helpful to me. (Assuming it's correct.)

…for-rust.rst Co-authored-by: Geoffrey White <40627776+geoffw0@users.noreply.github.com>

Co-authored-by: Geoffrey White <40627776+geoffw0@users.noreply.github.com>

Add the 'Publish data extension files in a CodeQL model pack to share' section, matching the structure used in C#, C++, Go, and Java docs. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

coadaflorin · 2026-04-20T14:27:07Z

@geoffw0 if this looks good to you, I'll ask the doc teams if they can take a quick look and then I'lll try to see if I can merge at a similar time to @owen-mc so we have the docs up for all languages that can use barriers

Copilot

Pull request overview

Note

Copilot was unable to run its full agentic suite in this review.

Adds a new Rust language guide page describing how to write CodeQL data extensions for Rust library modeling, and wires it into the Rust docs index.

Changes:

Added a new documentation page describing Rust-specific modeling concepts (canonical paths, access paths, and extensible predicates).
Added the new page to the Rust language guide toctree and link list.

Show a summary per file

File	Description
docs/codeql/codeql-language-guides/customizing-library-models-for-rust.rst	New guide page explaining how to create Rust library models with data extensions.
docs/codeql/codeql-language-guides/codeql-for-rust.rst	Adds the new guide page to the Rust documentation navigation.

Copilot's findings

Files reviewed: 2/2 changed files
Comments generated: 2

Copilot · 2026-04-20T14:29:13Z

+It would also be possible to merge the two rows into one by using a comma-separated list in the second value:
+
+.. code-block:: yaml
+
+  extensions:
+    - addsTo:
+        pack: codeql/rust-all
+        extensible: summaryModel
+      data:
+        - ["<std::path::Path>::join", "Argument[self,0]", "ReturnValue", "taint", "manual"]
+
+This row defines flow from both the receiver and the first argument to the return value. The second value ``Argument[self,0]`` is shorthand for specifying an access path to both ``Argument[self]`` and ``Argument[0]``.
+
+.. note::
+
+  When using ``Argument[self]`` to refer to the receiver, the ``Reference`` token may need to be appended to follow through the ``&self`` or ``&mut self`` reference to the underlying value. This depends on whether the data you want to track is on the reference itself or on the value behind the reference.


This merged example changes the meaning compared to the two-row version above. In the two-row version, the receiver flow is modeled from Argument[self].Reference (and the text states this is needed because join takes &self), but the merged form drops .Reference entirely. Either remove this merged example, or adjust it so it preserves the same access-path semantics (and clarify any limitations if the shorthand can’t express per-operand tokens).

Suggested change

It would also be possible to merge the two rows into one by using a comma-separated list in the second value:

.. code-block:: yaml

extensions:

- addsTo:

pack: codeql/rust-all

extensible: summaryModel

data:

- ["<std::path::Path>::join", "Argument[self,0]", "ReturnValue", "taint", "manual"]

This row defines flow from both the receiver and the first argument to the return value. The second value ``Argument[self,0]`` is shorthand for specifying an access path to both ``Argument[self]`` and ``Argument[0]``.

.. note::

When using ``Argument[self]`` to refer to the receiver, the ``Reference`` token may need to be appended to follow through the ``&self`` or ``&mut self`` reference to the underlying value. This depends on whether the data you want to track is on the reference itself or on the value behind the reference.

In this case, the two rows should not be merged into one by using a comma-separated list in the second value.

The receiver flow is modeled as ``Argument[self].Reference``, while the first argument is modeled as

``Argument[0]``. Since these access paths are different, keeping them as separate rows preserves the

correct semantics for ``Path::join``.

.. note::

When using ``Argument[self]`` to refer to the receiver, the ``Reference`` token may need to be appended to follow through the ``&self`` or ``&mut self`` reference to the underlying value. This depends on whether the data you want to track is on the reference itself or on the value behind the reference. If different operands require different access-path tokens, model them using separate rows instead of a single comma-separated shorthand.

Copilot · 2026-04-20T14:29:14Z

+- **Field[**\ ``type::field``\ **]** selects a named field of a struct or enum variant. For example, ``Field[core::option::Option::Some(0)]`` selects the first positional field of the ``Some`` variant.
+- **Field[**\ ``type(i)``\ **]** selects the ``i``-th positional field of a tuple struct or tuple enum variant. For example, ``Field[core::result::Result::Ok(0)]`` selects the value inside ``Ok``.


The description of Field[...] is internally inconsistent: the first bullet claims type::field and then gives an example that is not a named field (it’s a variant + positional index), and the second bullet describes type(i) but the example uses type::Variant(i). Please align the prose with the actual expected syntax (and update the examples accordingly) so readers can reliably construct correct Field[...] access paths.

Suggested change

- **Field[**\ ``type::field``\ **]** selects a named field of a struct or enum variant. For example, ``Field[core::option::Option::Some(0)]`` selects the first positional field of the ``Some`` variant.

- **Field[**\ ``type(i)``\ **]** selects the ``i``-th positional field of a tuple struct or tuple enum variant. For example, ``Field[core::result::Result::Ok(0)]`` selects the value inside ``Ok``.

- **Field[**\ ``type::field``\ **]** selects a named field of a struct or enum variant. For example, ``Field[my_crate::Config::path]`` selects the named field ``path``.

- **Field[**\ ``type(i)``\ **]** selects the ``i``-th positional field of a tuple struct, and **Field[**\ ``type::Variant(i)``\ **]** selects the ``i``-th positional field of a tuple enum variant. For example, ``Field[core::result::Result::Ok(0)]`` selects the value inside ``Ok``.

github-actions bot added the documentation label Apr 17, 2026

geoffw0 reviewed Apr 17, 2026

View reviewed changes

geoffw0 mentioned this pull request Apr 17, 2026

Document models-as-data barriers and barrier guards and add change notes #21523

Open

coadaflorin and others added 3 commits April 17, 2026 15:11

Update docs/codeql/codeql-language-guides/customizing-library-models-…

73695db

…for-rust.rst Co-authored-by: Geoffrey White <40627776+geoffw0@users.noreply.github.com>

Update docs/codeql/codeql-language-guides/codeql-for-rust.rst

7c9dd05

Co-authored-by: Geoffrey White <40627776+geoffw0@users.noreply.github.com>

Add model pack publishing section to Rust docs

1c8b90e

Add the 'Publish data extension files in a CodeQL model pack to share' section, matching the structure used in C#, C++, Go, and Java docs. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

coadaflorin marked this pull request as ready for review April 20, 2026 14:25

coadaflorin requested a review from a team as a code owner April 20, 2026 14:25

Copilot AI review requested due to automatic review settings April 20, 2026 14:25

Copilot started reviewing on behalf of coadaflorin April 20, 2026 14:26 View session

Copilot AI reviewed Apr 20, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: Add 'Customizing library models for Rust' documentation#21727

docs: Add 'Customizing library models for Rust' documentation#21727
coadaflorin wants to merge 5 commits intomainfrom
docs/customizing-library-models-for-rust

coadaflorin commented Apr 17, 2026

Uh oh!

geoffw0 left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

geoffw0 Apr 17, 2026

Uh oh!

owen-mc Apr 17, 2026

Uh oh!

coadaflorin commented Apr 20, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Apr 20, 2026

Uh oh!

Copilot AI Apr 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		- Field[\ ``type::field``\ ] selects a named field of a struct or enum variant. For example, ``Field[core::option::Option::Some(0)]`` selects the first positional field of the ``Some`` variant.
		- Field[\ ``type(i)``\ ] selects the ``i``-th positional field of a tuple struct or tuple enum variant. For example, ``Field[core::result::Result::Ok(0)]`` selects the value inside ``Ok``.

Conversation

coadaflorin commented Apr 17, 2026

Summary

What's included

Changes

Uh oh!

geoffw0 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

geoffw0 Apr 17, 2026

Choose a reason for hiding this comment

Uh oh!

owen-mc Apr 17, 2026

Choose a reason for hiding this comment

Uh oh!

coadaflorin commented Apr 20, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Copilot's findings

Uh oh!

Copilot AI Apr 20, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 20, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants