Senior Data Engineer
About the Position
As a Senior Data Engineer, you’ll play a key role in building and scaling the data infrastructure that powers our AI-driven platform. You’ll be responsible for designing, implementing, and optimizing reliable and scalable data pipelines that process large volumes of structured and unstructured data, from synthetic LLM prompts to large-scale web-scraped datasets, across a growing AWS-based data ecosystem.
This role is focused on enabling rapid scale. Our data volume and traffic are increasing quickly as we expand to new AI channels and data sources, and we need robust, production-grade data systems that can keep pace with that growth. You’ll work closely with engineering, product, and go-to-market teams to ensure data is reliable, observable, and reusable across the organization.
A core part of the role will be shaping the evolution of our data platform, including contributing to the design and implementation of our Data Lake architecture. You’ll help ensure our pipelines can handle increasing load, maintain high data quality, and support new product capabilities as we scale.
You’ll also act as a trusted technical partner across teams, helping establish data best practices, improving operational reliability, and enabling teams to use data effectively in both product and business contexts.
This role is remote in Germany.
What You’ll Be Doing
- Design, build, and maintain scalable data pipelines that ingest, transform, and validate large volumes of data across multiple sources and channels.
- Improve the scalability, reliability, and performance of our data pipelines to support rapidly growing workloads and new data streams.
- Contribute to the design and implementation of our Data Lake architecture, enabling reliable data storage and reuse across teams.
- Manage and optimize data ingestion workflows, including data collected from web scrapers, third-party vendors, and internal systems.
- Monitor pipeline health, investigate incidents, and implement improvements to increase system reliability and observability.
- Support the onboarding and integration of new AI channels and data sources into the platform.
- Collaborate with teams across the organization to ensure data generated by different systems can be reused effectively for analytics and business intelligence.
- Identify and resolve performance bottlenecks in distributed systems, including rate limiting, concurrency, and throughput constraints.
- Advise engineering and product teams on data architecture, data quality, and best practices for managing scalable data workflows.
- Continuously evaluate and improve our data platform to support the company’s rapid growth and evolving product needs.
Qualifications
- Strong experience building and operating scalable data pipelines in production environments.
- Hands-on experience working with Data Lakes or Data Warehouses (e.g., AWS Athena or similar technologies).
- Experience with data transformation and modeling.
- Strong experience working with AWS.
- Experience using Infrastructure-as-Code tools to manage cloud infrastructure.
- Proficiency in Python for data processing and automation.
- Experience working with distributed systems and managing large-scale data workflows.
- Experience implementing monitoring, observability, and incident response practices for data systems.
Nice to have:
- Experience working with large-scale web scraping or external data ingestion systems.
- Experience supporting systems with rapidly increasing traffic or data volume.
About Bluefish:
Bluefish believes that AI represents the next major chapter of the internet – and that consumers will increasingly use AI to consume information and media online. On this new AI internet, brands will need new tools and technologies to tell their stories to consumers online – and a new marketing ecosystem will be created around AI. Bluefish is building the platform that helps brands engage consumers on this new AI channel, with powerful enterprise tools to manage AI brand safety and engage consumers with thoughtful and personalized AI marketing experiences.
The Bluefish team is a tight-knit group of mar-tech industry veterans who previously helped build foundational ad-tech platforms now owned by Meta and Microsoft. Bluefish, the leading AI marketing platform for the Fortune 500, recently announced a $43 million Series B growth financing co-led by Threshold Ventures and NEA with participation from multiple enterprise investors including Amex Ventures, TIAA Ventures, Salesforce Ventures, and continued participation from Bloomberg Beta. We are a globally distributed team, with business operations based in New York City and engineering based in Europe and New York.
Why Bluefish & Our Values:
- Unique opportunity to join on the ground floor of a fast-moving startup building at the center of AI
- Tackle challenging and abstract problems while disrupting the $300BN legacy mar-tech industry
- Join an experienced high-performing team where you will have immediate ownership and impact
- Experience a true meritocracy with significant career growth upside as the business scales
Our Values:
- Demonstrated grit and resourcefulness - you find creative solutions, adapt quickly, and aren’t afraid to roll up your sleeves in a fast-paced, evolving environment.
- Thoughtful and strategic thinker with the ability to anticipate needs, ask the right questions, and deliver insights that truly drive value.
- Proven ability to collaborate across functions and roles, fostering open communication and shared ownership of outcomes.
- Commitment to craftsmanship - you take pride in delivering high-quality work, paying attention to the small details while keeping the big picture in mind.
- Self-starter with a proactive mindset and a strong sense of accountability - able to drive projects forward independently while staying aligned with team goals.
- Comfortable navigating ambiguity and learning new tools, processes, or technologies to get the job done.
Create a Job Alert
Interested in building your career at Bluefish AI? Get future opportunities sent straight to your email.
Apply for this job
*
indicates a required field
