Updates for the Issue #15 by KnightofInd · Pull Request #27 · texasbe2trill/AlertSage

KnightofInd · 2025-12-26T17:25:39Z

Introduced 00_getting_started_tutorial.ipynb as a beginner-friendly, interactive Jupyter notebook covering environment setup, model loading, first prediction, batch analysis, visualizations, and hands-on exercises. Updated docs/notebooks.md to document the new notebook, its features, and revised the recommended reading order for new users, SOC analysts, ML engineers, and contributors.

Description

Please include a summary of the changes and which issue is fixed. Include relevant motivation and context.

Fixes # (issue)

Type of change

Please delete options that are not relevant.

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
Documentation update

How Has This Been Tested?

Please describe the tests that you ran to verify your changes. Provide instructions so we can reproduce.

Test A
Test B

Checklist:

My code follows the style guidelines of this project
I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works
New and existing unit tests pass locally with my changes
Any dependent changes have been merged and published in downstream modules

Introduced 00_getting_started_tutorial.ipynb as a beginner-friendly, interactive Jupyter notebook covering environment setup, model loading, first prediction, batch analysis, visualizations, and hands-on exercises. Updated docs/notebooks.md to document the new notebook, its features, and revised the recommended reading order for new users, SOC analysts, ML engineers, and contributors.

Copilot

Pull request overview

This PR introduces a beginner-friendly tutorial notebook (00_getting_started_tutorial.ipynb) to help new users get started with AlertSage. The tutorial covers environment setup, model loading, predictions, batch analysis, visualizations, and hands-on exercises.

Key Changes:

New interactive Jupyter notebook with 9 sections covering AlertSage fundamentals
Updated documentation in docs/notebooks.md to include the new notebook in the sequence
Revised recommended reading order for different user types (new users, SOC analysts, ML engineers, contributors)

Reviewed changes

Copilot reviewed 1 out of 2 changed files in this pull request and generated 3 comments.

File	Description
notebooks/00_getting_started_tutorial.ipynb	New comprehensive getting started tutorial with setup verification, model loading, single/batch predictions, 4 interactive visualizations, uncertainty analysis, LLM overview, and 3 hands-on exercises
docs/notebooks.md	Updated notebook count from 10 to 11, added section documenting new tutorial features, and revised recommended reading order to start with notebook 00 for all user types

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2025-12-26T17:45:44Z

notebooks/00_getting_started_tutorial.ipynb

+      "🔍 Sample Incidents Preview:\n",
+      "\n",
+      " event_id      event_type severity                                                                                                                                                                                                                                                                                                                                                                                                     description\n",
+      "   171216    access_abuse   medium                                                                                         Bsaed on current evidence repeated account lockuots for gina.t associated with sign-in attempts frmo unrecognized locations. auth telemetry shwos unusual IPs including 158.173.238.165 and 59.154.26.63, which do not match historical baselines. This pattern aligns with MITRE ATT&CK technique T1110 (Burte Force).\n",


Multiple spelling errors in this sample incident description that appear to be intentional (simulating real-world typos): "Bsaed" should be "Based", "frmo" should be "from", "shwos" should be "shows", "Burte" should be "Brute".

Suggested change

" 171216 access_abuse medium Bsaed on current evidence repeated account lockuots for gina.t associated with sign-in attempts frmo unrecognized locations. auth telemetry shwos unusual IPs including 158.173.238.165 and 59.154.26.63, which do not match historical baselines. This pattern aligns with MITRE ATT&CK technique T1110 (Burte Force).\n",

" 171216 access_abuse medium Based on current evidence repeated account lockuots for gina.t associated with sign-in attempts from unrecognized locations. auth telemetry shows unusual IPs including 158.173.238.165 and 59.154.26.63, which do not match historical baselines. This pattern aligns with MITRE ATT&CK technique T1110 (Brute Force).\n",

Copilot · 2025-12-26T17:45:44Z

notebooks/00_getting_started_tutorial.ipynb

+      "   171216    access_abuse   medium                                                                                         Bsaed on current evidence repeated account lockuots for gina.t associated with sign-in attempts frmo unrecognized locations. auth telemetry shwos unusual IPs including 158.173.238.165 and 59.154.26.63, which do not match historical baselines. This pattern aligns with MITRE ATT&CK technique T1110 (Burte Force).\n",
+      "   420689    access_abuse     info Preliminary analysis indicates repeated failed login attempts for alice.w from 201.48.110.54 (RU), followed by a successful loign outsied normal working huors. The system lokced the account briefly and then allowed access, whcih is consistent with password guessign. This pattenr aligns wtih MITRE ATT&CK technique T1110 (Brute Force). Login lcoation matches known travel but device is unrecognized.\n",
+      "   244999    access_abuse   medium                                                                             First-level review shows repeated account lockouts for leo.v associated wtih sign-in attempts form unrecognized locations. Authentication telemetry shows unusual IPs including 229.254.100.75 and 8.41.173.22,8 which do not match historical baselines. This behavior algins with MITRE ATTC&K technique T1021 (Remtoe Services).\n",
+      "   252969 benign_activity      low                    Prleiminary analysis indiactes taht EDR alert on MACBOOK-SEC-01 is a false psoitive. The flagged process is a legitimate business application communicating wtih approved cloud srevices at 243.65.61.87:22. Security team verified the digital signature and confirmed this is authorized sfotware opertaing normally. Tuning rule 'Monitoring - Capacity threshold alret' to reduce noies.\n",


Multiple spelling errors in this line: "Prleiminary" should be "Preliminary", "indiactes" should be "indicates", "taht" should be "that", "psoitive" should be "positive", "srevices" should be "services", "sfotware" should be "software", "opertaing" should be "operating", "alret" should be "alert".

Suggested change

" 252969 benign_activity low Prleiminary analysis indiactes taht EDR alert on MACBOOK-SEC-01 is a false psoitive. The flagged process is a legitimate business application communicating wtih approved cloud srevices at 243.65.61.87:22. Security team verified the digital signature and confirmed this is authorized sfotware opertaing normally. Tuning rule 'Monitoring - Capacity threshold alret' to reduce noies.\n",

" 252969 benign_activity low Preliminary analysis indicates that EDR alert on MACBOOK-SEC-01 is a false positive. The flagged process is a legitimate business application communicating wtih approved cloud services at 243.65.61.87:22. Security team verified the digital signature and confirmed this is authorized software operating normally. Tuning rule 'Monitoring - Capacity threshold alert' to reduce noies.\n",

Copilot · 2025-12-26T17:45:45Z

notebooks/00_getting_started_tutorial.ipynb

+      "Top 3 Probabilities: suspicious_network_activity:0.52, web_attack:0.33, benign_activity:0.14\n",
+      "\n",
+      "Incident Description (first 200 chars):\n",
+      "Perliminary analysis indicates command-and-control-style traffic form WIN10-LAPTOP-01 (147.6.5.4205) to external infratsructure at 1188.4.105.112:3389 (CN). NetFlow analysis revelas DNS queries to sus...\n",


Multiple spelling errors in the incident description: "Perliminary" should be "Preliminary", "form" should be "from", "infratsructure" should be "infrastructure", "revelas" should be "reveals".

Suggested change

"Perliminary analysis indicates command-and-control-style traffic form WIN10-LAPTOP-01 (147.6.5.4205) to external infratsructure at 1188.4.105.112:3389 (CN). NetFlow analysis revelas DNS queries to sus...\n",

"Preliminary analysis indicates command-and-control-style traffic from WIN10-LAPTOP-01 (147.6.5.4205) to external infrastructure at 1188.4.105.112:3389 (CN). NetFlow analysis reveals DNS queries to sus...\n",

texasbe2trill · 2025-12-26T17:57:22Z

@KnightofInd These spelling errors copilot caught are intended to mimic incident responders entering event descriptions in various ticketing systems. Since this NLP system utilizes a synthetic dataset, without typos or noise injected, this would've caused the confusion matrix to be perfect and the baseline model to inaccurately classify events. I'll review the Jupyter notebook shortly and if it looks good, this PR will be squashed and merged. Thanks again for your help!

texasbe2trill

@KnightofInd I reviewed your changes and fixes to issue #15. The implemented changes are good to squash and merge into main. Thanks again for your support and contribution this was excellent work!

KnightofInd · 2025-12-26T18:17:01Z

@texasbe2trill Thank you for the thorough review and the encouraging feedback. I’m glad the changes fit well with your goals. Appreciate the opportunity to contribute and happy to help again.

texasbe2trill · 2025-12-26T18:19:32Z

@texasbe2trill Thank you for the thorough review and the encouraging feedback. I’m glad the changes fit well with your goals. Appreciate the opportunity to contribute and happy to help again.

@KnightofInd Feel free to pick up another issue or create issues you see fit to enhance or fix. Thanks again for your excellent work and all of your help!

texasbe2trill requested a review from Copilot December 26, 2025 17:42

Copilot started reviewing on behalf of texasbe2trill December 26, 2025 17:43 View session

Copilot AI reviewed Dec 26, 2025

View reviewed changes

texasbe2trill approved these changes Dec 26, 2025

View reviewed changes

texasbe2trill merged commit 4cf567e into texasbe2trill:main Dec 26, 2025
7 checks passed

texasbe2trill added this to AlertSage Jan 3, 2026

texasbe2trill moved this to Backlog in AlertSage Jan 3, 2026

github-project-automation bot moved this from Backlog to Done in AlertSage Jan 3, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Updates for the Issue #15#27

Updates for the Issue #15#27
texasbe2trill merged 1 commit intotexasbe2trill:mainfrom
KnightofInd:main

KnightofInd commented Dec 26, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Dec 26, 2025

Uh oh!

Copilot AI Dec 26, 2025

Uh oh!

Copilot AI Dec 26, 2025

Uh oh!

texasbe2trill commented Dec 26, 2025

Uh oh!

texasbe2trill left a comment

Uh oh!

KnightofInd commented Dec 26, 2025

Uh oh!

texasbe2trill commented Dec 26, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	" 171216 access_abuse medium Bsaed on current evidence repeated account lockuots for gina.t associated with sign-in attempts frmo unrecognized locations. auth telemetry shwos unusual IPs including 158.173.238.165 and 59.154.26.63, which do not match historical baselines. This pattern aligns with MITRE ATT&CK technique T1110 (Burte Force).\n",
	" 171216 access_abuse medium Based on current evidence repeated account lockuots for gina.t associated with sign-in attempts from unrecognized locations. auth telemetry shows unusual IPs including 158.173.238.165 and 59.154.26.63, which do not match historical baselines. This pattern aligns with MITRE ATT&CK technique T1110 (Brute Force).\n",

	"Perliminary analysis indicates command-and-control-style traffic form WIN10-LAPTOP-01 (147.6.5.4205) to external infratsructure at 1188.4.105.112:3389 (CN). NetFlow analysis revelas DNS queries to sus...\n",
	"Preliminary analysis indicates command-and-control-style traffic from WIN10-LAPTOP-01 (147.6.5.4205) to external infrastructure at 1188.4.105.112:3389 (CN). NetFlow analysis reveals DNS queries to sus...\n",

Uh oh!

Conversation

KnightofInd commented Dec 26, 2025

Description

Type of change

How Has This Been Tested?

Checklist:

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Key Changes:

Reviewed changes

Uh oh!

Copilot AI Dec 26, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Dec 26, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Dec 26, 2025

Choose a reason for hiding this comment

Uh oh!

texasbe2trill commented Dec 26, 2025

Uh oh!

texasbe2trill left a comment

Choose a reason for hiding this comment

Uh oh!

KnightofInd commented Dec 26, 2025

Uh oh!

texasbe2trill commented Dec 26, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants