Docs/Disclaimer

Disclaimer

Important information about Cadence's capabilities, limitations, and responsible use.

Important information about Cadence's capabilities, limitations, and responsible use.

What Cadence Is

Cadence is a detection tool designed to assist in identifying AI-generated content through pattern analysis. It examines:

  • Git repositories - Analyzes commit patterns, velocity, timing, and code structure
  • Websites - Scans content for patterns common in AI-generated text

Cadence provides probabilistic estimates to flag suspicious content for human review.

Critical Limitations

No Perfect Accuracy

Cadence cannot guarantee 100% accuracy. AI detection is inherently challenging because:

  • AI is always evolving - New models produce different outputs
  • Human writing varies - Some humans write like AI naturally
  • Context matters - Technical writing often uses formulaic language
  • Adversarial techniques - Prompt engineering can evade detection

False Positives

Cadence may flag human-written content as AI-generated when:

  • Code is auto-generated but legitimate (build tools, generators, migrations)
  • Writing follows formulaic patterns (technical docs, boilerplate, templates)
  • Non-native English speakers exhibit different writing patterns
  • Large refactors or bulk commits occur (legitimate developer activity)
  • Boilerplate code is reused across modules

Examples:

  • Lock files (package-lock.json) are technically auto-generated but legitimate
  • Technical documentation often uses repetitive structure
  • Large commits during refactoring aren't necessarily suspicious
  • Consistent code style across a codebase isn't inherently problematic

False Negatives

Cadence may miss AI-generated content when:

  • AI models improve and create more "human-like" output
  • Content is heavily edited after generation
  • New AI generation techniques emerge
  • Prompt engineering produces less detectable patterns
  • Models are fine-tuned on human-written content

Examples:

  • AI-generated code that's been hand-edited by humans
  • New models not yet seen in training data
  • Sophisticated prompting that produces legitimate-looking output
  • AI used for legitimate assistance (not malicious intent)

Confidence Scores

What they mean:

  • Confidence scores are probabilistic estimates, not certainties
  • Higher scores suggest higher likelihood of AI-generated content
  • They are not proof of AI generation

How to interpret:

  • 0.0-0.3 - Likely human-written
  • 0.3-0.6 - Uncertain, manual review recommended
  • 0.6-0.8 - Likely AI-generated, warrants investigation
  • 0.8-1.0 - Very likely AI-generated, but verify manually

Never rely on scores alone for definitive conclusions.

What Cadence Is NOT

Cadence is not:

  • ❌ A definitive proof that content is AI-generated
  • ❌ A replacement for human judgment and review
  • ❌ A tool for making accusations without investigation
  • ❌ Legally binding verification
  • ❌ Suitable for punitive action without human review
  • ❌ An endorsement that flagged content is malicious

Good Uses

  • Initial screening of large codebases or content libraries
  • Flagging suspicious content for human review
  • Trend analysis across repositories over time
  • Awareness of potential AI-assisted contributions
  • Educational purposes to understand AI patterns
  • Voluntary internal monitoring of your own teams
  • Security testing to identify potential vulnerabilities

Problematic Uses

  • Automatic rejection of flagged content without review
  • Public accusations based on Cadence results alone
  • Disciplinary action without human investigation
  • Mass surveillance of developers or writers
  • Punitive systems that penalize flagged content
  • Discrimination based on Cadence scores
  • Replacing human expertise with automated tools

Ethical Considerations

Use Cadence responsibly:

1. Don't Make Accusations

  • Never publicly claim code is AI-generated based solely on Cadence
  • Always follow up with human review and investigation
  • Consider context - AI assistance isn't inherently wrong

2. Investigate Thoroughly

Before taking action based on Cadence results:

  • Review the actual code or content
  • Consider alternative explanations
  • Look for nuance and context
  • Talk to the author if possible
  • Consult domain experts

3. Respect Privacy and Laws

  • Respect privacy when analyzing content
  • Follow applicable laws in your jurisdiction (GDPR, CCPA, etc.)
  • Get permission before analyzing others' content
  • Communicate transparently about how/why you use detection tools

4. Consider Context

AI assistance isn't always bad:

  • Some legitimate uses exist (code generation, refactoring suggestions)
  • Different contexts have different appropriateness
  • Blanket policies may harm legitimate users
  • Nuance and judgment are required

5. Be Transparent

If using Cadence in professional settings:

  • Tell people you're using detection tools
  • Explain limitations transparently
  • Describe your process for verification
  • Allow appeals and human review
  • Avoid surprise accusations

Known Constraints

Supported Languages

  • Git analysis - Works with any programming language
  • Web analysis - Currently optimized for English

Repository Size

  • Works with repositories of any size
  • May be slow with extremely large repos (100,000+ commits)
  • Recommend analyzing specific branches for very large repos

Webhook Limitations

  • Job timeout: 5 minutes per analysis
  • Concurrent workers: Configurable (default 4)
  • Job queue: 100 pending jobs

Contributing to Improvement

Cadence is open source. You can help improve it:

  • Report false positives/negatives - GitHub Issues
  • Submit improvements - Pull Requests
  • Share feedback - Discussions
  • Test detection - Help identify accuracy issues
  • Star the repository - Show support for the project

Ongoing Development

Cadence is actively maintained and improved:

  • Detection refinement - Continuously improving pattern recognition
  • New strategies - Adding detection methods as AI evolves
  • Bug fixes - Addressing issues as discovered
  • Performance - Optimizing speed and accuracy
  • Documentation - Keeping guides current

No Warranty

Cadence is provided "as is" without any warranty, express or implied.

No Liability

The authors are not liable for:

  • Damages from using this tool
  • Consequences of Cadence's results
  • False positives or false negatives
  • Decisions made based on Cadence output
  • Any other issues arising from use

License

Cadence is licensed under AGPL-3.0. See LICENSE for details.

Questions?

Remember: Cadence is a tool to assist human judgment, not replace it. Always use good judgment and conduct proper investigation before taking action based on Cadence results.