What STRRL Known

❯

❯

❯

❯

02 ibm label broadcasting log analytics

02-ibm-label-broadcasting-log-analytics

Feb 26, 20262 min read

IBM: Scalable and Efficient Large-Scale Log Analysis with LLMs

Paper: https://arxiv.org/html/2511.14803v1
Code: https://github.com/Log-Analyzer/LogAn
Status: Read

Takeaway

Core pattern: Drain clustering first, then LLM classification on representatives only - this is the right approach for LAPP too
The three label types are highly reusable for LAPP:
- Golden Signal Classification (error / availability / latency / saturation / information)
- Fault Category Prediction (application / network / I/O / etc.)
- Named Entity Recognition (host, session ID, error code, etc.)
Label Broadcasting = the same “control plane / data plane” split as LILAC’s cache, just at a different granularity
3.2% edge case where template variables carry diagnostic cues - LAPP needs to handle this
Engineering-oriented: runs on CPU (BERTOps, small fine-tuned BERT, not GPT-class), deployed across 70 IBM products for 15 months
Report generation - important for LAPP product design:
- Summary Report: representative lines sorted by rarity (rarest = most important), filterable by label
- Temporal Trend: golden signal counts over time → answers “when did it start breaking?”
- Causal Graph: Granger causality on cluster time series → answers “how did the fault propagate?”
- Diagnosis Report: only show time windows containing faults, chronological, searchable by entity → answers “what exactly happened at that time?”
- Workflow: Summary (overview) → Temporal Trend (locate time) → Causal Graph (understand causality) → Diagnosis Report (deep dive)
425K lines → 74 representative lines (99.9% reduction) in real case study

Recent Thoughts

Cleanup DNS Cache
Feb 26, 2026
GCP Artifact Login
Feb 26, 2026
datadog-toto-time-series-observability-model
Feb 26, 2026
00-lapp-design-notes
Feb 26, 2026
01-lilac-log-parsing-llm-cache
Feb 26, 2026
02-ibm-label-broadcasting-log-analytics
Feb 26, 2026
03-loghub-2.0-issta24
Feb 26, 2026
04-sok-llm-log-parsing-2025
Feb 26, 2026
05-l4-llm-training-log-diagnosis-fse25
Feb 26, 2026
06-wide-events-scuba-burmistrov
Feb 26, 2026
07-observability-2.0-honeycomb
Feb 26, 2026
08-drain3-logpai
Feb 26, 2026
09-logparser-tools-benchmarks-icse19
Feb 26, 2026
10-llmparser-icse24
Feb 26, 2026
11-logparser-llm-kdd24
Feb 26, 2026
12-llmloganalyzer
Feb 26, 2026
13-logpilot-alert-diagnosis-ase25
Feb 26, 2026
14-iknow-rag-chatbot-cloud-ops-ase25
Feb 26, 2026
15-loghub-issre23
Feb 26, 2026
16-sazabi-logs-are-all-you-need
Feb 26, 2026

Graph View

IBM: Scalable and Efficient Large-Scale Log Analysis with LLMs
Takeaway

Backlinks

00-lapp-design-notes

Created with Quartz v4.5.1 © 2026

GitHub
Discord Community