Senior AI Engineer
Firemind
Apr 2025 – Present
London, UK
- Leading the development of Pulse, a scalable single tenancy product focused on real-time analytics and intelligent automation for enterprise use cases.
- Building intelligent agents and custom workflows for clients using Python and JavaScript, enabling tailored automation and decision support systems.
- Managing and scaling infrastructure on AWS, ensuring high availability, cost efficiency, and seamless deployment pipelines.
- Fine-tuning large language models (LLMs) using Amazon SageMaker to align performance with domain-specific requirements.
- Optimising inference performance using VLLM, TGI (Text Generation Inference), and NVIDIA NIM to reduce latency and improve throughput.
- Mentoring junior engineers and contributing to team-wide best practices in LLMOps and production ML systems.

