NLP Data Engineer

Amsterdam (3 days in office)

Up to €90k \+ benefits

Join a highly respected company operating across, working on the most heavily regulated sectors, including financial services. The organisation is committed to modernising legal practice through advanced data capabilities - transforming how legal professionals access insights, manage information, and solve complex client challenges.

Over the past two years, significant investment has gone into building a mature data function across BI, Data Engineering, Data Governance, and Integrations. Now the focus shifts to

solving deeper, more complex data problems

, particularly involving the conversion of

unstructured legal documents into high‑quality structured datasets

This is a role for someone who enjoys innovation, complexity, and impact.

The Team

You'll join the Data Engineering team and report into the

Head of Data

. The team works closely with Legal, Risk, Compliance, and client-facing service teams on both internal solutions and external client projects.

The Role

In this role, you will:

🔹 Work on complex internal and external data projects

Ingest, process, and structure large volumes of

unstructured legal and financial documents

Build data pipelines that transform text-heavy content into usable, analysable datasets.

Support client engagements by designing data solutions that help legal teams advise faster and with greater accuracy.

🔹 Apply advanced engineering to difficult data challenges

Tackle complex data modelling, parsing, extraction, and classification issues.

Implement NLP techniques (NER, sentiment, semantic search, vectorisation) to enhance document understanding.

Build robust pipelines for document ingestion, OCR, and data extraction.

🔹 Use modern data technologies

Leverage the Azure data platform for scalable ingestion and processing.

Work in Databricks on advanced text analytics, pattern detection, and transformation logic.

Support integration with low-code tools like Microsoft Fabric or TimeXtender where relevant.

🔹 Contribute to a maturing data ecosystem

Help shape best practices across the engineering team.

Partner with Data Governance, BI, and Integration teams to improve data quality and accessibility.

Guide adoption of new tools and approaches that move the firm toward smarter, AI-enabled legal services.

Your Skills and Experience

We’re looking for someone who brings:

Strong experience handling

unstructured data

, especially text-heavy documents.

Solid understanding of

data management, data governance, and regulatory constraints

Proven skills in designing and building scalable data pipelines in the

Azure ecosystem

Expertise with Databricks (PySpark/Scala) for heavy data processing and NLP workloads.

Experience with NLP techniques and frameworks (spaCy, Hugging Face, MLflow integration) is highly beneficial.

Ability to solve diverse, ambiguous data challenges across different business contexts.

Experience with TimeXtender or Fabric is an advantage, but not essential.

What They Offer

Hybrid, flexible working arrangements.

A role with real impact: your work enables legal teams to serve clients more efficiently and accurately.

Opportunities to work on a diverse mix of internal innovation projects and external client assignments.

A highly collaborative, mature data organisation backed by strong leadership and investment.

How to Apply

If you enjoy solving complex data problems and want to shape how a leading law firm uses data and AI to transform its practice, please apply with your CV.

NLP Data Engineer

Job Description

Looking for more opportunities?