Civic Transparency Simulation Toolkit

A foundational toolkit for generating synthetic transparency data and calculating metrics for research, testing, and education.

Overview

This package provides the essential building blocks for transparency research without revealing detection methods or assessment criteria. It enables researchers and educators to:

Generate controlled datasets with reproducible seeds
Calculate standard transparency metrics
Build reproducible analysis and teaching pipelines

Key Features

Standardized Data Types: Core structures for temporal events, content fingerprints, and aggregated metrics that enable reproducible research across different groups.
Synthetic Data Generation: Create realistic datasets with organic activity patterns, content clustering, and temporal dynamics. Generate both baseline and influenced scenarios for A/B comparisons.
Standard Metrics: Calculate metrics including duplicate rates, hash concentration (Herfindahl index), burst detection, and content type distributions.
Database Integration: Export data to JSONL and load into DuckDB for SQL-based analysis and visualization.
Cross-Platform CLI: Command-line interface for data generation and conversion workflows.

Quick Start

Install the package:

pip install civic-transparency-sdk

Generate synthetic data:

# Generate baseline world
ct-sim generate --world A --topic-id baseline --out world_A.jsonl

# Convert JSONL to DuckDB
ct-sim convert --jsonl world_A.jsonl --duck world_A.duckdb --schema schema/schema.sql

Use Cases

Academic Research: Generate controlled datasets for studying information dynamics
Education: Provide realistic datasets for analysis exercises and metric calculation practice
Algorithm Development: Create test datasets with known ground truth for tool development
Benchmarking: Use standard metrics and data formats to enable cross-group comparisons

Security Model

This package provides building blocks for transparency research. It does not include:

Detection algorithms or thresholds
Verification workflows or assessment criteria
Operational rules or alerting logic

These remain separate to prevent adversarial misuse while enabling legitimate transparency research.

Documentation Index

Civic Transparency PTag Spec
Civic Transparency PTag Types
Civic Transparency Verify (research)