Skip to content

Claim 4 (Comparing Languages)

If I have an engine that can scan and compare codebases across time and language, independent of compilation, I don't know about you, but I want to take it for a spin! What patterns will emerge as we compare repositories of Python, COBOL, and TypeScript at a massive scale?

This is my current validation and proof-of-principle dataset. I have scaled the pipeline up to analyze over 14,000 repositories encompassing 2.57 million files across 50+ languages. The engine scores them according to GitGalaxy's physical risk exposure metrics and structural DNA, and I am presenting it publicly for validation and criticism. Openness and public input are the only ways forward with a system like this.

The following ridge line plots visualize the distribution of these risk exposures and other metrics across the language ecosystem. I currently have the security measures set to be highly sensitive, meaning normal (but risky) coding practices are being listed as security warnings. The sensitivity of these detections can be tuned down for a less annoying daily workflow, and then cranked back to --paranoid for your final sanity check before pushing a release live.

Click on any image to view it at full resolution.

Architectural Risk Exposures

Ridge plot comparing different programming languages based on Cognitive Load Exposure
Cognitive Load Exposure
Ridge plot comparing different programming languages based on Tech Debt Exposure
Tech Debt Exposure
Ridge plot comparing different programming languages based on Error & Exception Exposure
Error & Exception Exposure
Ridge plot comparing different programming languages based on API Exposure
API Exposure
Ridge plot comparing different programming languages based on State Flux Exposure
State Flux Exposure
Ridge plot comparing different programming languages based on Concurrency Exposure
Concurrency Exposure
Ridge plot comparing different programming languages based on Testing Exposure
Testing Exposure
Ridge plot comparing different programming languages based on Documentation Exposure
Documentation Exposure
Ridge plot comparing different programming languages based on Specification Exposure
Specification Exposure
Ridge plot comparing different programming languages based on Graveyard Exposure
Graveyard Exposure
Ridge plot comparing different programming languages based on Civil War Formatting Exposure
Civil War Exposure

Structural Physics & DNA

Ridge plot comparing different programming languages based on Structural Mass
Structural Mass
Ridge plot comparing different programming languages based on Control Flow Ratio
Control Flow Ratio
Ridge plot comparing different programming languages based on Control Flow Branches
Control Flow Branches
Ridge plot comparing different programming languages based on Max Function Complexity
Max Function Complexity
Ridge plot comparing different programming languages based on Avg Function Arguments
Avg Function Arguments
Ridge plot comparing different programming languages based on Outbound Imports
Outbound Imports
Ridge plot comparing different programming languages based on I/O & Network Boundaries
I/O & Network Boundaries

Volatility & Authorship

Ridge plot comparing different programming languages based on Volatility Exposure (Churn)
Volatility Exposure (Churn)
Ridge plot comparing different programming languages based on Instability Exposure (Age)
Instability Exposure (Age)
Ridge plot comparing different programming languages based on Silo Risk (Bus Factor)
Silo Risk (Bus Factor)
Ridge plot comparing different programming languages based on Ownership Entropy
Ownership Entropy

Core Security & Vulnerability Risks

Ridge plot comparing different programming languages based on Hardcoded Payload Artifacts
Hardcoded Payload Artifacts
Ridge plot comparing different programming languages based on Obfuscation & Evasion Surface
Obfuscation & Evasion Surface
Ridge plot comparing different programming languages based on Exploit Generation Surface
Exploit Generation Surface
Ridge plot comparing different programming languages based on Weaponizable Injection Vectors
Weaponizable Injection Vectors
Ridge plot comparing different programming languages based on Raw Memory Manipulation
Raw Memory Manipulation




🌌 Powered by the blAST Engine

This documentation is part of the GitGalaxy Ecosystem, an AST-free, LLM-free heuristic knowledge graph engine.