Added: 2026-03-10 12:35.41
Updated: 2026-03-26 03:05.11

Data Engineer

Madrid, Spain

Type: Cognitive AI

Category: Data Engineer

Advertisement
Skill needed: SQL, Python, Pyspark, Pandas, Apache Spark.
Employer: ThetaRay

ThetaRay is a trailblazer in AI-powered Anti-Money Laundering (AML) solutions, offering cutting-edge technology to fintechs, banks, and regulatory bodies worldwide. Our mission is to enhance trust in financial transactions, ensuring compliant and innovative business growth. Our technology empowers customers to expand into new markets and introduce groundbreaking products.Why Join ThetaRay?At ThetaRay, you'll be part of a dynamic global team committed to redefining the financial services sector through technological innovation. You will contribute to creating safer financial environments and have the opportunity to work with some of the brightest minds in AI, ML, and financial technology. We offer a collaborative, inclusive, and forward-thinking work environment where your ideas and contributions are valued and encouraged.Join us in our mission to revolutionize the financial world, making it safer and more trustworthy for millions worldwide. Explore exciting career opportunities at ThetaRay – where innovation meets purpose.We are looking for a Data Engineer to join our growing team of data experts. As a Data Engineer, you will be responsible for designing, implementing, and optimizing data pipeline flows within the ThetaRay system. You will support our data scientists with the implementation of the relevant data flows based on the data scientist’s features design and construct complex rules to detect money laundering activity.The ideal candidate has experience in building data pipelines and data transformations and enjoys optimizing data flows and building them from the ground up. They must be self-directed and comfortable supporting multiple production implementations for various use cases. ResponsibilitiesImplement and maintain data pipeline flows in production within the ThetaRay system based on the data scientist’s designDesign and implement solution-based data flows for specific use cases, enabling the applicability of implementations within the ThetaRay productBuilding a Machine Learning data pipelineCreate data tools for analytics and data scientist team members that assist them in building and optimizing our product into an innovative industry leaderWork with product, R&D, data, and analytics experts to strive for greater functionality in our systemsTrain customer data scientists and engineers to maintain and amend data pipelines within the productTravel to customer locations both domestically and abroadBuild and manage technical relationships with customers and partnersRequirements2+ years of Hands-on experience working with Apache Spark - mustHands-on experience with SQLHands-on experience with version-control tools such as GITHands-on experience with Apache Hadoop Ecosystem including Hive, Impala, Hue, HDFS, Sqoop etc..Experience with Python (Pandas)Experience with PySpark/Scala/Java/RHands-on experience with data transformation, validations, cleansing, and ML feature engineeringBSc degree or higher in Computer Science, Statistics, Informatics, Information Systems, Engineering, or another quantitative fieldExperience working with and optimizing big data pipelines, architectures, and data sets - an advantageStrong analytic skills related to working with structured and semi-structured datasetsBuild processes supporting data transformation, data structures, metadata, dependency, and workload managementExperience performing root cause analysis on internal and external data and processes to answer specific business questions and identify opportunities for improvementBusiness-oriented and able to work with external customers and cross-functional teamsFluent in English & Spanish both written and spokenNice to haveExperience with LinuxExperience in building Machine Learning pipelineExperience with ElasticsearchExperience with Zeppelin/JupyterExperience with workflow automation platforms such as Jenkins or Apache AirflowExperience with Microservices architecture components, including Docker and Kubernetes.
Advertisement
Click here to apply and get more details about this job!
It will open in a new tab.
Terms and Conditions - Webmaster - Privacy Policy - Pro coding!