Experienced Data Architect – Large-Scale Data Pipeline Engineering, Cloud Infrastructure & Business Intelligence Solutions

Posted 2026-05-05
Remote, USA Full-time Immediate Start

Are you ready to join one of the world's most innovative technology teams and help shape the future of data-driven logistics? arenaflex is seeking talented Data Architects to play a pivotal role in our Dataworks division, where you'll be at the forefront of building and maintaining enterprise-scale data infrastructure that powers global shipping and logistics operations.

At arenaflex, we believe that data is the backbone of modern logistics. Every package tracking update, route optimization calculation, and delivery prediction model relies on robust data pipelines engineered by talented professionals like you. Join us and become part of a team that processes billions of data points daily, enabling seamless delivery experiences for millions of customers worldwide.

About the Dataworks Division

The Dataworks division at arenaflex serves as the intellectual hub for data engineering and architectural innovation within our organization. We are the creative force behind the platforms that transform raw data into actionable business intelligence, driving strategic decisions across every level of our enterprise. Our mission is to create scalable, efficient, and reliable data solutions that not only meet current business needs but anticipate future challenges in the rapidly evolving logistics industry.

As a Data Architect at arenaflex, you will function as a "universal translator" bridging the gap between IT infrastructure teams, business stakeholders, software developers, and data scientists. This interdisciplinary role offers unique exposure to multiple facets of our organization, providing a holistic understanding of how data powers global logistics operations. You'll collaborate with diverse teams of varying backgrounds and expertise levels, contributing to projects that have immediate and measurable impact on our business operations.

Key Responsibilities

Your day-to-day responsibilities will span the full spectrum of data architecture and engineering, requiring both technical excellence and strategic thinking. Here's what you can expect:


  • Deep Business and Technical Understanding: Develop comprehensive knowledge of both the business challenges that Dataworks aims to solve and the technical architectures required to address them. You'll need to understand the "why" behind each data initiative and translate that into robust technical solutions.

  • Pipeline and Platform Development: Design, build, test, and maintain data pipelines and platforms that enable teams to analyze data efficiently, build predictive models, and drive data-informed decision making. You'll work on systems that scale from individual workstation processing to enterprise-wide distributed computing environments.

  • Scalability Engineering: Transition solutions from "computer scale" to "cluster scale" considerations, addressing both infrastructure architecture and problem-solving methodologies that can handle massive data volumes characteristic of global logistics operations.

  • Cross-Functional Collaboration: Partner with teams across the organization to generate data-driven operational insights that translate into high-value process improvements. You'll work alongside professionals from various disciplines, bringing together diverse perspectives to solve complex challenges.

  • Rapid Value Delivery: Deliver tangible results quickly while collaborating with teams of varying backgrounds and technical specialties. Balance speed of delivery with quality and long-term maintainability.

  • Best Practice Documentation: Document and classify best practices for future reuse, creating templates, patterns, and codebases that other teams can leverage. Your contributions will extend beyond immediate projects to benefit the entire data engineering community at arenaflex.

  • External Technology Engagement: Engage with senior technologists from across the enterprise and external partner environments to create synergies and ensure smooth integration with downstream operational systems.

  • Code Review and Architecture: Participate actively in code reviews and large-scale architectural discussions, ensuring that solutions meet both technical standards and business requirements.

  • Environment Maintenance: Keep the data environment and pipelines upgraded, optimized, and running efficiently. Conduct regular performance investigations and troubleshoot data-related issues, including providing Level 3 support for complex problems.

  • Data Transformation: Implement parsers, validators, transformers, and correlators to reformat, update, and enrich data throughout the pipeline lifecycle.

  • Mentorship: Provide guidance and mentorship to team members in less senior positions, helping develop the next generation of data professionals at arenaflex.

Required Qualifications

We seek candidates with strong technical foundations and proven experience in data engineering or related fields. While we value formal education, we also recognize that real-world experience often provides equally valuable skills.

Educational Background


  • Bachelor's degree in Computer Science, Information Systems, Data Science, Mathematics, Engineering, or a related quantitative discipline

  • Advanced degrees (Master's or PhD) in relevant fields may substitute for some experience requirements

Technical Expertise


  • Solid foundation in software engineering, database systems, and distributed systems architecture

  • Familiarity with distributed and cloud computing environments, with deep understanding of optimizing computational performance

  • Experience building robust cloud-based data engineering and curation solutions that make data accessible and useful across multiple applications

  • Strong proficiency with Microsoft Azure tooling for large-scale data engineering projects, including familiarity with Azure Databricks, Azure Data Factory, Azure SQL Database, and Azure Synapse Analytics

  • Experience developing and operationalizing capabilities for near-real-time high-volume streaming scenarios

  • Active development skills with ability to work at the code level and troubleshoot complex technical issues

  • Demonstrated history of designing and delivering large-scale technical solutions that provide ongoing, measurable value

  • Direct experience building and deploying complex production systems that implement modern data processing methodologies at scale

  • Ability to context-switch effectively and support distributed teams, including functioning as a "rescue programmer" to overcome challenging technical obstacles

  • Strong problem-solving skills with ability to work through undefined and evolving challenges

  • Demonstrated ability to lead technical projects with teams, often working under tight deadlines to deliver value

  • An "engineering mindset" with ability to make rapid, practical decisions to improve performance, accelerate progress, or maximize impact

  • Comfortable working with distributed teams on code-based deliverables, using version control systems and participating in code reviews

  • Ability to lead data analysis, profiling, and lineage studies to document and improve data quality and access

  • Experience with Agile and DevOps practices for project and software management, including continuous integration and continuous deployment (CI/CD)

Programming Languages and Tools

Proficiency with some of the following languages and tools is highly valued:


  • Apache Spark (Scala and PySpark), Kafka, and other high-volume data processing tools

  • SQL and NoSQL storage solutions such as MySQL, PostgreSQL, MongoDB, or Cosmos DB

  • Java and Python for data tool development

  • Azure DevOps experience including work tracking, git-based version control patterns, and building CI/CD pipelines

  • Understanding of data engineering patterns to support varying business requirements

  • Experience with multiple data formats (JSON, XML, Parquet, Avro, unstructured data) for both batch and streaming ingestion

  • Azure Kubernetes Service, Event Hubs, or related technologies for implementing streaming ingestion

  • Experience developing and implementing alerting and monitoring strategies

  • Working knowledge of Infrastructure as Code (IaC) through Terraform for creating and deploying resources

  • Execution experience across different data stores, messaging systems, and data processing engines

  • Data integration through APIs and REST services

  • PowerPlatform (Power BI, Power Apps, Power Automate) development experience is a plus

Experience Levels and Requirements

Data Architect I



  • Bachelor's degree in relevant field plus one year of equivalent training or work experience

  • Basic knowledge of data engineering and AI frameworks including design, development, and implementation of complex systems and data pipelines

  • Basic knowledge of Data Systems including design, development, and implementation of large batch or online transaction-based systems

  • Experience as a junior member of multi-functional project teams

  • Strong verbal and written communication skills

Data Architect II



  • Bachelor's degree plus two years of relevant work experience in measurement and analysis, quantitative business problem-solving, simulation development, or predictive analytics

  • Strong knowledge of data engineering and AI frameworks including design, development, and implementation of highly complex systems and data pipelines

  • Strong knowledge of Data Systems including design, development, and implementation of large batch or online transaction-based systems

  • Solid understanding of the logistics industry, competitors, and emerging technologies

  • Experience as a member of multi-functional project teams

Data Architect III



  • Bachelor's degree plus three to four years of relevant work experience

  • Extensive knowledge of data engineering and AI frameworks including design, development, and implementation of highly complex systems and data pipelines

  • Extensive knowledge of Data Systems including design, development, and implementation of large batch or online transaction-based systems

  • Strong understanding of the logistics industry, competitors, and emerging technologies

  • Experience providing leadership in training or consulting settings

  • Experience as a senior member of multi-functional project teams

Data Architect Lead



  • Bachelor's degree plus five to seven years of relevant work experience

  • Extensive knowledge of data engineering and AI frameworks including design, development, and implementation of highly complex systems and data pipelines

  • Extensive knowledge of Data Systems including design, development, and implementation of large batch or online transaction-based systems

  • Strong understanding of the logistics industry, competitors, and emerging technologies

  • Experience providing leadership in training or consulting settings

  • Experience as a leader or senior member of multi-capability project teams

Work Environment and Culture

At arenaflex, we foster a collaborative, innovation-driven culture that values technical excellence, continuous learning, and mutual respect. Our Dataworks teams work in an environment that encourages experimentation and creative problem-solving. We believe that the best solutions emerge when diverse perspectives come together, and we actively promote inclusive practices throughout our organization.

You'll have access to cutting-edge technologies and tools, with opportunities to work on challenging problems that have real-world impact. Our investment in professional development means you'll continuously grow your skills through training programs, certifications, and exposure to emerging technologies in the data engineering space.

Compensation and Benefits

We offer competitive compensation packages that reflect the value our Data Architects bring to our organization. The salary range for these positions varies based on experience, location, and specific role level. In addition to competitive base salaries, arenaflex provides comprehensive benefits including health insurance, retirement plans, paid time off, and professional development opportunities.

For positions in Colorado, Nevada, Connecticut, New York, California, Rhode Island, Washington, Hawaii, Illinois, and New Jersey, the monthly salary range is approximately $6,317.00 to $15,477.00. Actual compensation will be determined based on factors including experience, qualifications, and location.

Location and Travel

This position can be domiciled anywhere in the United States, offering flexibility for candidates across the country. We support remote work arrangements and have the infrastructure to enable distributed teams to collaborate effectively.

How to Apply

If you're ready to take the next step in your data engineering career and join a team that's transforming the logistics industry, we encourage you to apply. Please submit your current resume (in Microsoft Word or PDF format) along with your responses to our work screening questions.

At arenaflex, we value diversity and are committed to creating an inclusive environment for all employees. We encourage candidates from all backgrounds to apply and join us in building the future of global logistics.

Similar Jobs

Back to Job Board