PodMorph

Cross-Platform Media Transformation for Social Platforms

Overview

PodMorph, was a colaboration between Captain Mcfunshines Software Engineering, Overcoat media and independant Project leader and independant designer in 2024. I led the development of a cross-platform application that transforms podcasts and radio programs into various media formats optimized for social media platforms. Using advanced AI integration, the system automates content repurposing while preserving brand integrity and key messaging across all outputs. The value proposition of this app was to allow journalists and sound engineers to spend more time on their jobs by automating the peripheral tasks.

PodMorph Platform Overview

Technical Achievements

Software Architecture & Cross-Platform Development

As Lead Architect, I designed and implemented a comprehensive cross-platform solution:

  • Architected and implemented a comprehensive Kotlin Multiplatform (KMP) solution delivering native applications for 7 platforms (iOS, Android, Windows, macOS on both x64 and ARM, Linux on both x64 and ARM)
  • We colabarativly designed and implemented the UI using Compose Multiplatform, enabling consistent interfaces across all supported platforms
  • Established a robust Model-View-Intent (MVI) architecture for predictable state management and UI interactions
  • Implemented client-server architecture to control AI interaction and balance processing requirements between local devices and cloud resources
  • Integrated multiple AI services with fallback mechanisms to ensure service continuity
Software Architecture Software Architecture

Cloud Infrastructure & API Integration

I developed a robust cloud infrastructure to handle complex media processing:

  • Designed cloud-based processing workflow using Azure Container Instances and AWS S3 for data storage
  • Implemented PostgreSQL database management with data models for user accounts, projects, payments, and content artifacts
  • Orchestrated multiple AI services including Groq, Anthropic Claude, OpenAI, Fal.ai, Whisper, and Google Vertex AI with fallback mechanisms
  • Built optimized media processing pipelines for transcription, summarization, and text, image and video content generation
  • Created secure API communication using Ktor with proper authentication and error handling
Cloud Infrastructure

Media Processing & Transformation

The heart of PodMorph lies in its sophisticated media processing capabilities:

  • Developed advanced text-to-image generation workflows using multiple AI models (DALL-E, Flux, Idiogram) for diverse creative outputs
  • Created sophisticated audio/video processing capabilities using FFMPEG for precise clip extraction and multi-layer composition
  • Implemented intelligent text overlay systems with dynamic sizing and multi-layer outlining for readability
  • Built systems to automatically synchronize text animations with audio using segment timings
  • Integrated natural language processing for entity recognition, sentiment analysis, and quote extraction

Key Contributions

  • Responsible for architcture and development of the product
  • Created a complete AI processing pipeline from audio transcription to fully-rendered social media content
  • Implemented intelligent content transformation with multiple specialized outputs (Twitter threads, Instagram Reels, LinkedIn posts, etc.)
  • Developed sophisticated content verification algorithms to ensure AI-generated content matches source material
  • Built a consistent cross-platform user experience that feels native on each supported operating system
  • Designed intelligent fallback mechanisms throughout the system to ensure reliability during API outages

Technologies & Skills

  • Kotlin Multiplatform (KMP) for cross-platform development
  • Compose Multiplatform for UI framework
  • Model-View-Intent (MVI) architecture pattern
  • Azure Container Instances and AWS S3 for cloud infrastructure
  • PostgreSQL database and Exposed ORM
  • AI service integration (Groq, Anthropic Claude, OpenAI, Google Vertex)
  • FFMPEG for media manipulation
  • Ktor for networking and API communication
  • HikariCP for database connection pooling
  • CI/CD with GitHub Actions
  • Docker containerization

Transforming audio content into engaging social media experiences through AI-powered automation