What files were downloaded using a web browser?

UUID: 8620a183-d67f-481e-a63c-d8b8dfa5e968 ID: Q1001

Approaches to Answer

Collect download records from local browser artifacts [Q1001.10]
- Parse download records from web browsers' own databases.
- Tags: Web Browser SQLite Chrome Safari Edge
Detect browser downloads via file system event logs [Q1001.11]
- File downloads by some web browsers create a specific pattern of events on the file system. We can use this to see browser downloads using file system logs (like Santa).
- Tags: Web Browser macOS
Detect browser downloads via change journal records [Q1001.12]
- File downloads by some web browsers create a specific pattern of events on the file system. We can use this to see browser downloads using NTFS change journal (USN journal) records.
- Tags: Web Browser Windows USN Journal NTFS

Approaches

Note

There are often multiple ways to approach answering a question. This section explains the approaches considered, how they work, and any benefits or drawbacks to each. All these approaches have different pros and cons. They can be used individually or in conjunction.

Collect download records from local browser artifacts

🗂️ Explanation

Parse download records from web browsers' own databases.

Most web browsers track file downloads (at least in non-Incognito/private sessions) in local artifacts on the system. We can collect and review these records. This approach is useful when you have remote access to a system (via GRR for example) or a disk image.

References

Chrome, Firefox, and Safari downloads on ForensicsWiki
Web Browsers on ForensicArtifacts

📝 Notes

Each approach comes with certain caveats or limitations. These can often be tacit knowledge or assumed that "everyone knows that" (even though they don't). This can lead to incorrect assumptions and analysis. Explicitly stating what is covered and not covered by the approach reduces ambiguity.

Covered

Chrome downloads. Beyond "stable" Chrome, this also includes Chromium, Chrome Canary, Chrome Beta, and other Chromium-based browsers (Microsoft Edge, Brave, and Opera) on Windows, macOS, and Linux
Safari downloads
Includes downloads from all Chrome profiles

Not Covered

Firefox downloads
Downloads on any other browsers
Browsers installed in non-standard paths
Downloads made during Incognito sessions

💾 Data

The following data source(s) are needed for this approach to the question.

Description: Collect local browser history artifacts. These are often in the form of SQLite databases and JSON files in multiple directories.
Type: ForensicArtifact
Value: BrowserHistory (view on GitHub)

⚙️ Processors

A processor is what takes the data collected and processes it in some way to produce structured data for an investigator to review. Multiple processor options can be defined, as there are often multiple programs capable of doing similar processing. Plaso is an example of a processor (it processes raw artifacts into a timeline). After the data is processed, additional analysis steps may be needed to answer the question.

The following processors can process the raw data specified above. Explicit instructions on how to run the processor are not included here, but any relevant configuration options are.

PlasoHindsight

More information on Plaso.

Recommended options:

 --parsers webhist

📊 Analysis

After processing the raw data, further analysis steps are necessary to answer the question. After loading Plaso's output into one of these analysis platforms, use the following steps to refine the data to answer the question.

OpenSearch

Filter the results to just file downloads

OpenSearch query:

data_type:("chrome:history:file_downloaded" OR "safari:downloads:entry")

Python Notebook

Filter the results to just file downloads

Python code:

df.query('data_type in ("chrome:history:file_downloaded", "safari:downloads:entry")')

More information on Hindsight.

Recommended options:

 --format jsonl

📊 Analysis

After processing the raw data, further analysis steps are necessary to answer the question. After loading Hindsight's output into one of these analysis platforms, use the following steps to refine the data to answer the question.

OpenSearch

Filter the results to just file downloads. Hindsight only supports Chromium-based browsers (including Microsoft Edge) and not Firefox or Safari.

OpenSearch query:

data_type:"chrome:history:file_downloaded"

Python Notebook

Filter the results to just file downloads. Hindsight only supports Chromium-based browsers (including Microsoft Edge) and not Firefox or Safari.

Python code:

df.query('data_type == "chrome:history:file_downloaded"')

Detect browser downloads via file system event logs

🗂️ Explanation

File downloads by some web browsers create a specific pattern of events on the file system. We can use this to see browser downloads using file system logs (like Santa).

Chrome downloads on macOS create a specific pattern of file system events:

Download starts with a WRITE event to a temp file named like .com.google.Chrome.avXx0U (the suffix looks to be 6 letters/numbers).
That .com.google.Chrome... temp file might be RENAMEd to Unconfirmed [6 digits].crdownload.
If [2] Unconfirmed...crdownload RENAME didn't happen, the .com.google. temp file is renamed to the eventual filename (with extension) with .crdownload appended at the end (like cat.jpg.crdownload). In either case ([2] or [3]), at this point we have a file ending in .crdownload.
Once the download completes, the .crdownload suffix is removed via a RENAME event, leaving the downloaded file with its final name.

Since [4] above captures the key information about the file download, we can search file system logs for it.

📝 Notes

Each approach comes with certain caveats or limitations. These can often be tacit knowledge or assumed that "everyone knows that" (even though they don't). This can lead to incorrect assumptions and analysis. Explicitly stating what is covered and not covered by the approach reduces ambiguity.

Covered

Chrome downloads on macOS (given that Santa was installed and logging enabled)

Not Covered

All other browsers
Downloads from other Chromium/Chrome-based browsers might be collected via this method, but testing is necessary to confirm.
Downloads that occurred on macOS with Santa, but during a time period for which there are no Santa logs (out of retention or Santa was disabled).

💾 Data

The following data source(s) are needed for this approach to the question.

Description: Santa logs stored on the local disk; they may also be centralized off-system, but this artifact does not include those.
Type: ForensicArtifact
Value: SantaLogs (view on GitHub)

⚙️ Processors

A processor is what takes the data collected and processes it in some way to produce structured data for an investigator to review. Multiple processor options can be defined, as there are often multiple programs capable of doing similar processing. Plaso is an example of a processor (it processes raw artifacts into a timeline). After the data is processed, additional analysis steps may be needed to answer the question.

The following processors can process the raw data specified above. Explicit instructions on how to run the processor are not included here, but any relevant configuration options are.

Plaso

More information on Plaso.

Recommended options:

 --parsers santa

📊 Analysis

After processing the raw data, further analysis steps are necessary to answer the question. After loading Plaso's output into one of these analysis platforms, use the following steps to refine the data to answer the question.

OpenSearch

Filter the results to file system rename events where the original file name ended with .crdownload.

OpenSearch query:

data_type:"santa:file_system_event" action:RENAME file_path:"*\.crdownload"

Detect browser downloads via change journal records

🗂️ Explanation

File downloads by some web browsers create a specific pattern of events on the file system. We can use this to see browser downloads using NTFS change journal (USN journal) records.

Chrome downloads on Windows create a specific pattern of events on the file system:

**Download starts with a FILE CREATE event to .tmp file; older Chrome versions have been observed creating four character file names (like 7DEE.tmp) while newer versions use file names consistent with Version 4 UUIDs (like 183011a0-f96c-4c98-b71b-ab8754fb0e0e.tmp).
That .tmp file might be RENAMEd to Unconfirmed [6 digits].crdownload.
If [2] Unconfirmed...crdownload RENAME didn't happen, the .tmp file is renamed to the eventual filename (with extension) with .crdownload appended at the end (like cat.jpg.crdownload). In either case ([2] or [3]), at this point we have a file ending in .crdownload.
Once the download completes, the .crdownload suffix is removed via a USN_REASON_RENAME_OLD_NAME event, leaving the downloaded file with its final name.

Since [4] above captures the key information about the file download, we can search change journal records for it.

References

Change Journals

📝 Notes

Each approach comes with certain caveats or limitations. These can often be tacit knowledge or assumed that "everyone knows that" (even though they don't). This can lead to incorrect assumptions and analysis. Explicitly stating what is covered and not covered by the approach reduces ambiguity.

Covered

Chrome downloads to an NTFS volume

Not Covered

All other browsers.
Downloads from other Chromium/Chrome-based browsers might be collected via this method, but testing is necessary to confirm.
Downloads on file systems other than NTFS.
Downloads that would be covered, but happened long enough ago that the USN Journal records that would show it have been deleted.

💾 Data

The following data source(s) are needed for this approach to the question.

Description: The NTFS $UsnJnrl file system metadata file. This ForensicArtifact definition does not include the $J alternate data stream, but many tools collect it anyway.
Type: ForensicArtifact
Value: NTFSUSNJournal (view on GitHub)

⚙️ Processors

A processor is what takes the data collected and processes it in some way to produce structured data for an investigator to review. Multiple processor options can be defined, as there are often multiple programs capable of doing similar processing. Plaso is an example of a processor (it processes raw artifacts into a timeline). After the data is processed, additional analysis steps may be needed to answer the question.

The following processors can process the raw data specified above. Explicit instructions on how to run the processor are not included here, but any relevant configuration options are.

Plaso

More information on Plaso.

Recommended options:

 --parsers usnjrnl

📊 Analysis

After processing the raw data, further analysis steps are necessary to answer the question. After loading Plaso's output into one of these analysis platforms, use the following steps to refine the data to answer the question.

OpenSearch

Filter the results to file system rename events where the original file name ended with .crdownload.

OpenSearch query:

data_type:"fs:ntfs:usn_change" filename:crdownload "USN_REASON_RENAME_OLD_NAME"

Select and search for the file_reference value for an event of interest from the previous query. There should be one with the same timestamp as your previous event and its filename value is the download's final name.

OpenSearch query:

data_type:"fs:ntfs:usn_change" {file_reference value} "USN_REASON_RENAME_NEW_NAME"