Frequently Asked Questions

General Questions

What is CRML?

CRML (Cyber Risk Modeling Language) is an open, declarative language for expressing cyber risk models. It allows you to:

Model cyber risks using statistical distributions
Run Monte Carlo simulations
Calculate risk metrics (EAL, VaR)
Make data-driven security decisions

Think of it as "YAML for cyber risk" - simple, readable, and powerful.

How accurate are CRML simulations?

CRML simulations are as accurate as the parameters you provide. The Monte Carlo method itself is mathematically sound and widely used in finance and insurance.

Accuracy depends on: - Quality of input parameters (lambda, mu, sigma) - Number of simulation runs (more = more accurate) - How well your model represents reality

Best practices: - Use industry data (Verizon DBIR, IBM reports) - Calibrate from your historical incidents - Start conservative, refine over time - Run sensitivity analysis

Do I need to be a statistician to use CRML?

No! CRML is designed for security professionals, not statisticians.

You need to understand: - Basic probability (what's a 10% chance?) - Your organization's risks - How to read industry reports

You don't need: - Advanced mathematics - Statistical software experience - Programming skills (though Python API is available)

The Writing CRML guide explains everything step-by-step.

How does CRML compare to other risk tools?

Feature	CRML	FAIR-U	RiskLens	Spreadsheets
Open Source	✅	❌	❌	✅
Cost	Free	Paid	Paid	Free
Learning Curve	Low	Medium	Medium	Low
Flexibility	High	Medium	Low	High
Statistical Rigor	High	High	High	Varies
Automation	✅	Limited	✅	Limited

CRML advantages: - Free and open source - Simple YAML syntax - Extensible and programmable - Active development

Technical Questions

How do I choose between Poisson and Gamma for frequency?

Use Poisson when: - Events are random and independent - You know the probability per asset - Examples: phishing, malware, random attacks

Use Gamma when: - Frequency varies significantly - Events come in clusters - Examples: attack campaigns, seasonal patterns

Rule of thumb: Start with Poisson. It's simpler and works for 90% of cases.

How do I choose between Lognormal and Gamma for severity?

Use Lognormal when: - Most losses are small, but some are huge - Distribution is right-skewed - Examples: data breaches, ransomware, most cyber losses

Use Gamma when: - Losses are more symmetric - You have specific shape/scale parameters - Less common in cyber risk

Rule of thumb: Use Lognormal for cyber risk. It matches real-world loss distributions.

What if I don't have historical data?

You have several options:

1. Use industry data: - Verizon DBIR (Data Breach Investigations Report) - IBM Cost of Data Breach Report - Sophos State of Ransomware - Ponemon Institute studies

2. Expert estimates: - Ask your security team - Use conservative estimates - Document assumptions

3. Start simple:

# Conservative phishing model
frequency:
  lambda: 0.05  # 5% (lower than industry average)
severity:
  mu: 10.0      # ~$22K (modest loss)
  sigma: 1.0    # Moderate variability

4. Calibrate over time: - Start with estimates - Track actual incidents - Refine parameters quarterly

How many simulation runs should I use?

Quick testing: 1,000 runs (fast, less accurate)
Standard analysis: 10,000 runs (recommended)
High precision: 100,000 runs (slow, very accurate)

Rule of thumb: Use 10,000 runs. It's the sweet spot between speed and accuracy.

What does "mu" mean in lognormal distribution?

mu is the natural logarithm of the median loss.

Quick formula:

median_loss = e^mu
mu = ln(median_loss)

Examples: - mu = 10.0 → ~ $22K median loss - mu = 11.5 → ~$ 100K median loss - mu = 13.5 → ~$700K median loss

Tip: Use an online calculator or: python -c "import math; print(math.exp(11.5))"

Can I model multiple risk scenarios in one file?

Not directly in a single CRML model, but you can:

Option 1: Separate files

phishing-risk.yaml
ransomware-risk.yaml
data-breach-risk.yaml

Option 2: Aggregate in code

from crml import CRMLModel

phishing = CRMLModel("phishing.yaml")
ransomware = CRMLModel("ransomware.yaml")

total_risk = phishing.eal + ransomware.eal

Option 3: Portfolio model (coming soon) - Model organization-wide risk - Account for correlations - Planned for CRML 2.0

Practical Questions

How do I present CRML results to executives?

Focus on three numbers:

EAL (Expected Annual Loss) - "We expect to lose $X per year on average"
VaR 95% - "In 95% of years, losses will be below $Y"
VaR 99% - "Only 1 in 100 years will exceed $Z"

Example executive summary:

Phishing Risk Assessment

Expected Annual Loss: $220,000
- Budget this amount for phishing-related costs

95% Confidence Level: $450,000
- Losses will likely stay below this

Worst-Case Scenario (99%): $650,000
- Ensure insurance covers at least this amount

Recommendation: Invest $50K in MFA to reduce EAL by 60%

Tip: Use the web platform's charts for visual impact!

Can I use CRML for compliance (SOC 2, ISO 27001)?

Yes! CRML helps with:

SOC 2: - Risk assessment documentation - Quantitative risk analysis - Control effectiveness measurement

ISO 27001: - Annex A risk treatment - Risk assessment methodology - Continuous monitoring

PCI DSS: - Annual risk assessment - Compensating controls analysis

Tip: Export results to JSON and include in compliance documentation.

How do I justify security budget with CRML?

1. Calculate current risk:

crml simulate current-state.yaml
# EAL: $500K

2. Model with proposed control:

# Reduce lambda by 70% with MFA
frequency:
  lambda: 0.03  # Down from 0.10

3. Calculate risk reduction:

crml simulate with-mfa.yaml
# EAL: $150K

4. Show ROI:

Risk Reduction: $500K - $150K = $350K/year
Control Cost: $50K/year
ROI: 7x return on investment
Payback Period: 2 months

Can I integrate CRML with my SIEM/GRC tool?

Yes! CRML has several integration options:

1. JSON output:

crml simulate model.yaml --format json > results.json

2. Python API:

from crml import CRMLModel

model = CRMLModel("model.yaml")
results = model.simulate(runs=10000)

# Send to your GRC platform
grc_api.upload_risk_assessment(results)

3. REST API (via web platform):

curl -X POST http://localhost:3000/api/simulate \
  -H "Content-Type: application/json" \
  -d @model.yaml

4. Scheduled updates:

# Cron job to update risk dashboard
0 0 * * * crml simulate model.yaml --format json | \
  curl -X POST https://dashboard.example.com/api/risk

Troubleshooting

My simulation results seem wrong

Check these common issues:

Lambda too high/low:
Lambda = 0.10 means 10% per asset per year
Not 10 events per year!
Mu confusion:
Mu is ln(median), not the median itself
Use: mu = ln(desired_median_loss)
Cardinality:
Set to number of assets at risk
Not total employees/servers
Units:
All losses in dollars
All frequencies per year

Debug tip: Start with known examples from the docs and modify incrementally.

Validation errors

Common YAML errors:

# ❌ Wrong: tabs instead of spaces
model:
    frequency:  # Tab character

# ✅ Correct: spaces
model:
  frequency:  # 2 spaces

# ❌ Wrong: missing quotes
meta:
  name: my-model  # Should be quoted if contains hyphens

# ✅ Correct:
meta:
  name: "my-model"

Use the validator:

crml validate model.yaml

Performance issues

Simulation too slow?

Reduce runs temporarily:
```
crml simulate model.yaml --runs 1000
```
Check cardinality:
Very large cardinality (>100,000) can be slow
Consider portfolio-level modeling instead
Use JSON output:
Faster than formatted text
```
crml simulate model.yaml --format json
```

Advanced Questions

Can I model correlated risks?

Not yet in CRML 1.1, but it's on the roadmap!

Current workaround:

# Use Python to combine models with correlation
import numpy as np
from crml import CRMLModel

model1 = CRMLModel("ransomware.yaml")
model2 = CRMLModel("data-breach.yaml")

# Apply correlation factor
correlation = 0.7
combined_loss = model1.simulate() + correlation * model2.simulate()

Coming in CRML 2.0:

correlations:
  - events: ["ransomware", "data_breach"]
    correlation: 0.7

Can I model time-varying risk?

Not directly, but you can:

1. Model different time periods:

# Q4 (higher risk)
frequency:
  lambda: 0.15  # 50% increase

2. Run multiple scenarios:

crml simulate q1-model.yaml
crml simulate q2-model.yaml
crml simulate q3-model.yaml
crml simulate q4-model.yaml

3. Use Python for trends:

# Model increasing trend
lambdas = [0.05, 0.06, 0.07, 0.08]  # Growing risk
for i, lam in enumerate(lambdas):
    model.update_parameter("frequency.lambda", lam)
    results = model.simulate()
    print(f"Q{i+1} EAL: ${results.eal}")

How do I model security controls?

Method 1: Adjust parameters

# Before MFA
frequency:
  lambda: 0.10  # 10% phishing success

# After MFA (85% reduction)
frequency:
  lambda: 0.015  # 1.5% success rate

Method 2: Separate models

baseline-risk.yaml      # No controls
with-mfa.yaml          # With MFA
with-mfa-and-edr.yaml  # Multiple controls

Method 3: Python calculation

baseline = CRMLModel("baseline.yaml")
control_effectiveness = 0.85  # 85% reduction

reduced_lambda = baseline.lambda * (1 - control_effectiveness)

Still Have Questions?

Can't find your answer? Open a GitHub issue and we'll add it to this FAQ!