ZACHARY KLEIMAN | Kew Gardens, NY | zacharymkleiman@gmail.com | (917) 748-6503

therootuser.dev • Github:github.com/Zacharymk1213 • GitLab: gitlab.com/ZacharyKleiman

SUMMARY

Resourceful and self-directed Junior Software Engineer with experience spanning backend engineering, developer tooling, data engineering, ML modeling, digital archiving, system administration, and transpiler/DSL development.

Proven ability to deliver complex projects independently — from scalable web applications and data pipelines to compiler grammars, ML research, and CI/CD automation. Contributor to major open-source ecosystems including ANTLR (official grammars-v4) and GNOME.

Experienced in optimizing infrastructure costs, managing production traffic, building internal tools, and designing machine learning models with clear documentation and academic rigor.

Strong technical writing ability evidenced by formal research paper and extensive project documentation. Recognized in multiple performance reviews for contributions that impressed supervisors.

TECHNICAL SKILLS

Languages: Java, Python, JavaScript (Node.js& frontend), Typescript, SQL, Bash, ANTLR4 (Grammar Language), Rust (introductory; using for Servo work)

Web: HTML, CSS

Databases: PostgreSQL, SQLite

Backend/Systems: Spring Boot, REST APIs, Linux, Nginx, ingestion/indexing pipelines, Express

DevOps & Infra: AWS, DigitalOcean, Server provisioning, CI/CD (GitLab CI), cost optimization, log ingestion, AWStats

Machine Learning: scikit-learn, data cleaning, feature engineering, K-fold validation

EDUCATION

WGU | Bachelor of Science in Computer Science | 2022 – 2025
Highlighted Coursework: Machine Learning, Data Structures & Algorithms II, AI Optimization Developer micro-credential, Linux Foundations Certification, Axelos ITIL Foundation Certification

EXPERIENCE

Full-Stack Document Archive Platform — rabbimayerschiller.com | Production System, October 2024–Present

Full Stack Developer

  • Architected and deployed a Spring Boot–based document archiving and media platform serving 39,000+ unique visitors, 98,000+ visits, and 430,000+ pageviews annually with >200GB bandwidth (AWStats logs preserved).
  • Built full ingestion pipeline for hundreds of documents & recordings, including scanning, cleaning, metadata extraction, and search indexing. Implemented backend logic, file delivery system, Nginx configuration, and full-stack integration.
  • Performed end-to-end server migration from $48/month infrastructure to a $12 compute instance + $10 storage volume, achieving 54% cost reduction while maintaining performance.

DAR Language Toolchain — dar.js & DarANTLR - Pninim (https://pninim.org/en/download/) | June 2024 - Current

Software Engineer

  • Built dar.js, a Typescript-based transpiler converting the DAR domain-specific language into HTML; advanced the project to the point where it is publicly promoted and used by the organization forked from Djot.
  • Supervisor noted that “the quality of the books is now two levels higher” and that “the editors use it a lot,” highlighting substantial impact on daily editorial workflows.
  • Reimplemented the DAR grammar in Javascript using ANTLR (DarANTLR) and developed a visitor that outputs HTML, completing the full rewrite in a 2-week sprint and producing 162 unit tests to ensure correctness and stability.

Machine Learning Research Project — National Personalities & Freedom Index | Nov – Dec 2025

Research Engineer / ML Developer

  • Merged and analyzed 1,015,342 Big Five personality records with 3 years of CATO Human Freedom Index data.
  • Designed full cleaning pipeline, including multi-stage imputation (round 1 & 2), removal of invalid rows, merging of datasets, and normalization.
  • Achieved R² scores of 0.50–0.79 across 8 of 13 political freedom categories — strong by social science standards.
  • Performed K-Fold validation demonstrating stable predictive performance; documented results in a 10-page academic-style paper.

Open Source Contributor | 2023 - 2025

ANTLR SRT Grammar — Official grammars-v4 Contribution

  • Authored the SRT (subtitle) grammar merged into the official ANTLR/grammars-v4 repository; which is an official collection of grammars written for ANTLR V4.

GNOME Text Editor — Windows CI/CD Pipeline Contributions

  • Created a Windows build pipeline for GNOME Text Editor, enabling the application to be successfully built and deployed on Windows for the first time.

Application & Website Developer | July 2024-Present

Developed several desktop utilities and websites created for other people including:

  • File Searcher: Developed as part of my job at Community Board 7 this Python app allows searching the text of word files in bulk to sort poorly named files thereby saving countless man-hours. Mike, the Community Board’s IT guy was very impressed by it and approved its use.
  • Burns Depression Index Tracker: Desktop app created in Python to calculate score on Burns Depression Index. It also saves scores over time (down to the second) in a SQL database allowing for viewing one’s scores over time. It can also show a graph for a user-specified period to see how one’s scores have fluctuated. Created as no app existed for this index at the time.
  • Connections: Desktop app created in Python that allows people to make tables for various connections they have and list relevant contact information for them allowing people to log their social circle and network more efficiently.
  • Developed timrosen.com, a promotional site for Prof. Tim Rosen featuring book landing pages (covers, marketing copy, and Amazon “Buy Now” links), article and blog sections, and an author bio.
  • Creating a C API for servo-gtk - a GTK4 widget to embed Servo - in Rust (in-progress)