add local Codex skill for Python->C performance-focused translation
define modular C architecture and benchmark/correctness gates
add references for patterns, profiling, and module design
add scaffold_c_module.py to generate include/src/tests/bench skeleton
update agent default prompt for benchmark-backed optimizations