We are looking for an experienced Lead Data Engineer to oversee the design, implementation, and management of advanced data infrastructure in Houston, Texas. This role requires expertise in architecting scalable solutions, optimizing data pipelines, and ensuring data quality to support analytics, machine learning, and real-time processing. The ideal candidate will have a deep understanding of Lakehouse architecture and Medallion design principles to deliver robust and governed data solutions.<br><br>Responsibilities:<br>• Develop and implement scalable data pipelines to ingest, process, and store large datasets using tools such as Apache Spark, Hadoop, and Kafka.<br>• Utilize cloud platforms like AWS or Azure to manage data storage and processing, leveraging services such as S3, Lambda, and Azure Data Lake.<br>• Design and operationalize data architecture following Medallion patterns to ensure data usability and quality across Bronze, Silver, and Gold layers.<br>• Build and optimize data models and storage solutions, including Databricks Lakehouses, to support analytical and operational needs.<br>• Automate data workflows using tools like Apache Airflow and Fivetran to streamline integration and improve efficiency.<br>• Lead initiatives to establish best practices in data management, facilitating knowledge sharing and collaboration across technical and business teams.<br>• Collaborate with data scientists to provide infrastructure and tools for complex analytical models, using programming languages like Python or R.<br>• Implement and enforce data governance policies, including encryption, masking, and access controls, within cloud environments.<br>• Monitor and troubleshoot data pipelines for performance issues, applying tuning techniques to enhance throughput and reliability.<br>• Stay updated with emerging technologies in data engineering and advocate for improvements to the organization's data systems.
<p>Position Overview</p><p>We are seeking a Data Governance & Data Quality Platform Engineer to own the technical administration, integration, and optimization of enterprise data governance and data quality platforms (e.g., Atlan, Monte Carlo). This role ensures governance and quality tools are scalable, securely integrated into the enterprise data ecosystem, and maintained for high availability and performance.</p><p>The ideal candidate brings strong platform engineering skills, experience automating data quality and metadata workflows, and a solid understanding of governance, compliance, and modern data architectures.</p><p>Key Responsibilities</p><p><br></p><p>1. Platform Engineering & Administration</p><ul><li>Configure and maintain data governance platforms for metadata management, data lineage, and governance workflows</li><li>Configure data quality tools for profiling, rule creation, and monitoring dashboards</li><li>Manage platform security, including user roles, authentication, SSO, RBAC, and access controls</li></ul><p>e2. Integration & Automation</p><ul><li>Develop and maintain integrations across data sources, databases, data lakes, and BI tools</li><li>Automate metadata ingestion and data quality checks using APIs, Python scripts, or ETL frameworks</li><li>Configure and maintain connectors for analytics and reporting platforms</li></ul><p> 3. Performance, Reliability & Monitoring</p><ul><li>Monitor platform health and optimize performance and scalability</li><li>Apply upgrades, patches, and troubleshoot technical issues</li><li>Implement logging, alerting, and proactive monitoring for governance and data quality environments</li></ul><p>a4. Technical Support & Issue Resolution</p><ul><li>Provide Tier 3 support for platform‑related incidents and escalations</li><li>Debug integration failures and resolve configuration conflicts</li><li>Collaborate with vendors for advanced troubleshooting and roadmap alignment</li></ul><p>r5. Security, Compliance & Risk Management</p><ul><li>Ensure platforms comply with data privacy and security standards (e.g., GDPR, CCPA)</li><li>Implement encryption, audit logging, and access controls</li><li>Support compliance reporting and risk assessments using governance and data quality metrics</li></ul>
We are looking for an experienced Data Engineer to join our team in Houston, Texas. In this role, you will design, implement, and optimize data systems that support critical business operations. The ideal candidate will have a strong technical background and a passion for creating efficient, scalable solutions.<br><br>Responsibilities:<br>• Develop and maintain scalable data pipelines to ensure efficient processing of large datasets.<br>• Design and implement data architectures that support high-performance data integration and analysis.<br>• Collaborate with cross-functional teams to gather requirements and deliver tailored data solutions.<br>• Build and manage ETL workflows to support data transformation and integration processes.<br>• Optimize data storage and processing techniques using tools such as Apache Hadoop and Apache Spark.<br>• Implement real-time data streaming solutions using Apache Kafka.<br>• Troubleshoot and resolve issues within the data infrastructure to maintain system reliability.<br>• Monitor system performance and suggest improvements to enhance data processing efficiency.<br>• Document processes and workflows to ensure clarity and consistency in data operations.<br>• Stay current with emerging technologies and industry trends to continually improve data engineering practices.
<p>We are seeking a skilled <strong>Azure Data Engineer</strong> to design, build, and maintain scalable data solutions on the Microsoft Azure platform. The ideal candidate will have strong experience developing data pipelines, optimizing data architectures, and supporting analytics and business intelligence initiatives. This role will work closely with data analysts, data scientists, and business stakeholders to ensure reliable, high-quality data is available for reporting and advanced analytics.</p><p><br></p><p>Key Responsibilities</p><ul><li>Design, develop, and maintain <strong>scalable data pipelines and ETL/ELT processes</strong> using Azure data services.</li><li>Build and manage data solutions using tools such as <strong>Azure Data Factory, Azure Synapse Analytics, Azure Data Lake, and Azure Databricks</strong>.</li><li>Develop and optimize <strong>data models, transformations, and storage strategies</strong> for large-scale structured and unstructured datasets.</li><li>Ensure <strong>data quality, integrity, and security</strong> across the data platform.</li><li>Monitor and troubleshoot data workflows, pipeline failures, and performance issues.</li><li>Collaborate with data analysts, BI developers, and data scientists to deliver reliable datasets for reporting and analytics.</li><li>Implement <strong>data governance and best practices</strong> for data management and documentation.</li><li>Automate data processes and deployments using <strong>CI/CD pipelines and infrastructure-as-code practices</strong>.</li><li>Optimize cost and performance of Azure data services.</li><li>Stay current with new Azure features, tools, and industry best practices.</li></ul><p><br></p>
<p>Position Overview</p><p>We are seeking a talented <strong>Data Engineer</strong> with strong experience in <strong>Python, AWS, and Databricks</strong> to design and build scalable data pipelines and modern data platforms. The ideal candidate will help develop and maintain data infrastructure that supports analytics, machine learning, and business intelligence initiatives. This role requires hands-on experience working with large datasets, cloud-native architectures, and distributed data processing frameworks.</p><p><br></p><p>Key Responsibilities</p><ul><li>Design, build, and maintain <strong>scalable data pipelines and ETL/ELT workflows</strong> using Python and cloud technologies.</li><li>Develop and optimize data solutions using <strong>AWS services and Databricks</strong>.</li><li>Build and manage <strong>data lakes and data warehouses</strong> for structured and unstructured data.</li><li>Implement <strong>data transformation and processing pipelines</strong> using Apache Spark within Databricks.</li><li>Integrate data from multiple sources including APIs, databases, and streaming systems.</li><li>Ensure <strong>data quality, governance, security, and compliance</strong> across the data platform.</li><li>Monitor pipeline performance and troubleshoot <strong>data pipeline failures or latency issues</strong>.</li><li>Collaborate with <strong>data analysts, data scientists, and business stakeholders</strong> to deliver reliable datasets.</li><li>Optimize storage and compute costs within the AWS ecosystem.</li><li><br></li></ul><p><br></p>
<p>Architect and deliver modern data platform solutions with a strong emphasis on Databricks and contemporary cloud data technologies.</p><p>Build secure, scalable, and high‑performing data environments that enable analytics, reporting, and enterprise‑wide data initiatives.</p><p>Oversee and execute migrations from legacy relational databases into Databricks-based ecosystems.</p><p>Design and structure scalable data pipelines and foundational data infrastructure aligned with organizational goals.</p><p>Create and maintain ETL/ELT processes within Databricks to ensure efficient ingestion, transformation, and delivery of data.</p><p>Continuously refine and optimize data workflows to improve performance, stability, and data quality across all processes.</p><p>Manage end-to-end data transitions to ensure operational continuity with minimal business disruption.</p><p>Monitor Databricks workloads and optimize performance, scalability, and cost efficiency across compute and storage layers.</p><p>Partner with data engineers, scientists, analysts, and product stakeholders to gather requirements and build fit‑for‑purpose data solutions.</p><p>Establish and enforce data engineering best practices, development standards, and architectural guidelines.</p><p>Assess emerging tools and technologies to enhance pipeline efficiency, reliability, and automation capabilities.</p><p>Provide technical direction, guidance, and mentorship to junior engineers and team members.</p><p>Collaborate closely with DevOps and infrastructure teams to deploy, manage, and support data systems in production.</p><p>Ensure all data solutions meet compliance standards, organizational security policies, and regulatory obligations.</p><p>Work with enterprise architects and IT leadership to align data architecture with broader technology strategies and long-term roadmaps</p>
<p><strong>Principal Data Scientist (AI/ML Focus)</strong></p><p><strong>Service Type:</strong> 42 Week Contract </p><p><strong>Worksite:</strong> Onsite, Monday–Thursday — Houston, TX</p><p><strong>Pay: </strong>Available on W2 </p><p><strong>Position Overview</strong></p><p>We are seeking a <strong>Principal Scientist, Data</strong> with deep expertise in <strong>AI, Machine Learning, Natural Language Processing (NLP), Computer Vision (CV), and Generative AI</strong>. This role requires a strong technical foundation, excellent communication skills, and the ability to translate complex methodologies into meaningful business outcomes.</p><p>The ideal candidate is proactive, innovative, and passionate about developing advanced AI-driven solutions using modern architectures including <strong>LLMs, deep learning models, multi-agent systems, and generative AI techniques</strong>.</p><p><strong>Requirements</strong></p><ul><li>Strong background in <strong>NLP, Computer Vision, and Generative AI</strong>.</li><li>Broad background in <strong>Artificial Intelligence</strong>.</li><li>Excellent verbal and written communication skills.</li></ul><p> <strong>Key Responsibilities</strong></p><ul><li>Develop, train, and optimize <strong>machine learning and deep learning models</strong>.</li><li>Build advanced AI solutions using <strong>LLMs, multi-agent systems, fine-tuning techniques, and inference optimization</strong>.</li><li>Transform complex data science methodologies into actionable insights.</li><li>Collaborate closely with stakeholders to develop high-value, data-driven solutions.</li><li>Create clear, compelling presentations, dashboards, and deliverables for non-technical audiences.</li><li>Drive full lifecycle AI/ML projects from ideation through deployment.</li></ul>
We are looking for a skilled Data Engineer with expertise in AI/ML technologies and prior experience in the oil and gas industry to join our team in Houston, Texas. In this Contract to permanent position, you will play a key role in transforming data into actionable insights through advanced analytics and innovative solutions. This opportunity is ideal for professionals who thrive in data-driven environments and excel at leveraging tools like Power BI and PowerApps.<br><br>Responsibilities:<br>• Develop and manage Power BI dashboards and reports to deliver meaningful insights from raw data.<br>• Utilize PowerApps to create and maintain applications that support business intelligence initiatives.<br>• Collaborate with cross-functional teams to understand data requirements and implement solutions.<br>• Analyze complex datasets to identify trends and patterns that inform decision-making.<br>• Ensure the accuracy, reliability, and security of data within BI systems.<br>• Optimize data pipelines and workflows for improved performance and scalability.<br>• Provide technical expertise to support AI/ML integration into existing data processes.<br>• Stay updated on emerging technologies and best practices in data engineering and AI/ML.<br>• Troubleshoot and resolve issues related to data tools and processes.<br>• Document processes, workflows, and methodologies for future reference.