PAUL IACOBUCCI

CS student @ Cornell, 3x SWE/ML Intern @ L3Harris.

I've:
- solo-engineered a platform backed by Cornell for 45,000+ alumni.
- shipped full-stack + embedded applications for a major defense contractor.
- worked on AI infrastructure for industry clients through the Zhang Research Group.

Timeline: By taking the max credits every semester I've skipped 2 semesters to save my family tuition. Graduating May or Dec 2027 depending on spring/fall internships.

Objective: SWE/ML/AI infra internships for Fall 2026, Spring/Summer 2027, full-time roles in Summer/Winter 2027.

Recruiters and entrepreneurs please reach me at pmi22@cornell.edu

scroll for more ↓

Experience

Cornell Zhang Research Group
Research Assistant: SP26 - Profiled Mixture-of-Experts inference on 8×H100 GPUs for an industry client.
L3Harris Technologies
Software Engineering Intern: Three internships across software, embedded, and FPGA. Returning summer 2026 for ML inference on Qualcomm SoCs.
RapStudy
Software Engineering Intern: Full-stack development on a DoED-backed EdTech platform with a 3-engineer Cornell team.

Projects

Scout (Backed by Cornell)
A platform helping Cornell student-athletes network with and track relationships across 45,000+ Cornell athletic alumni. Solo full-stack build (Next.js + Expo/React Native) on a multi-tenant Postgres backend, shipped to 200+ active users.

View Site
Lion AI Detection Suite
Trained a CNN on librosa features and ElevenLabs deepfakes; deployed via ONNX to mobile, Chrome extension, and desktop. Real-time audio capture + sliding-window inference + user alerts, shipped to 20+ users. Built with PyTorch and React Native.

View Site
HFT Mixture-of-Experts FPGA Engine
An FPGA trading pipeline that runs end-to-end in 444ns at 83.3M messages/sec. Register-partitioned limit order book, sparse MoE router pipelined to one trade per cycle, bit-exact RTL/C++ verification in Verilator.

View GitHub
Mini-TensorRT: DL Graph Compiler
A C++ deep-learning graph compiler with a CUDA backend. Conv-ReLU-Add fusion cuts DRAM traffic for a 25% end-to-end speedup; paged-attention KV cache doubles batch concurrency.

View GitHub
Triton GPU Performance Kernels
Fused Triton kernels on H100. LayerNorm runs 45.7% faster with symmetric FP8 quantization. Scaled FlashAttention to 16K context by tiling for SRAM and computing softmax online in one pass.

View GitHub
Digital Level & Impact Monitor
An interrupt-driven tilt sensor and impact monitor on the FRDM-KL46Z (Cortex-M0+). Sleeps in __WFI between PIT timer wakeups; ARM assembly for the trig math; I2C accelerometer reads and UART alerting on impact.

View GitHub

Hackathons

Point72 Cubist Hackathon
Built an AI-orchestrated modular chess engine evaluation system. Used Claude via an MCP server to autonomously test, benchmark, and compare diverse AI-generated chess engines using SPRT and perft.

View GitHub
UC Berkeley AI Hackathon
Vocera: Biometric authentication and synthetic voice detection system built leveraging FastAPI, SpeechBrain, and OpenAI Whisper.

View GitHub
AppDev Hack Challenge FA24
LockedIn: Professional networking application. Awarded Best UI.

View GitHub

Outside Work

Lifting: chasing a 315 lb (3 plate) bench. Current PR: 305
Golf: chasing single-digit handicap. Current handicap: ~12
Travel: I've unfortunately been stuck in upstate NY for a while.
One Piece: caught up, now on the rewatch grind.
E&J: I help my cousin run a real estate business on the side. Check it out.