Skip to main content
H

NLP Data Engineer

Harnham

Location

Amsterdam, North Holland, Netherlands

Salary

Not specified

Type

fulltime

Posted

Today

via linkedin

Job Description

NLP Data Engineer

Amsterdam (3 days in office)

Up to €90k \+ benefits

Join a highly respected company operating across, working on the most heavily regulated sectors, including financial services. The organisation is committed to modernising legal practice through advanced data capabilities - transforming how legal professionals access insights, manage information, and solve complex client challenges.

Over the past two years, significant investment has gone into building a mature data function across BI, Data Engineering, Data Governance, and Integrations. Now the focus shifts to

solving deeper, more complex data problems

, particularly involving the conversion of

unstructured legal documents into high‑quality structured datasets

.

This is a role for someone who enjoys innovation, complexity, and impact.

The Team

You'll join the Data Engineering team and report into the

Head of Data

. The team works closely with Legal, Risk, Compliance, and client-facing service teams on both internal solutions and external client projects.

The Role

In this role, you will:

🔹 Work on complex internal and external data projects

  • Ingest, process, and structure large volumes of

unstructured legal and financial documents

.

  • Build data pipelines that transform text-heavy content into usable, analysable datasets.
  • Support client engagements by designing data solutions that help legal teams advise faster and with greater accuracy.

🔹 Apply advanced engineering to difficult data challenges

  • Tackle complex data modelling, parsing, extraction, and classification issues.
  • Implement NLP techniques (NER, sentiment, semantic search, vectorisation) to enhance document understanding.
  • Build robust pipelines for document ingestion, OCR, and data extraction.

🔹 Use modern data technologies

  • Leverage the Azure data platform for scalable ingestion and processing.
  • Work in Databricks on advanced text analytics, pattern detection, and transformation logic.
  • Support integration with low-code tools like Microsoft Fabric or TimeXtender where relevant.

🔹 Contribute to a maturing data ecosystem

  • Help shape best practices across the engineering team.
  • Partner with Data Governance, BI, and Integration teams to improve data quality and accessibility.
  • Guide adoption of new tools and approaches that move the firm toward smarter, AI-enabled legal services.

Your Skills and Experience

We’re looking for someone who brings:

  • Strong experience handling

unstructured data

, especially text-heavy documents.

  • Solid understanding of

data management, data governance, and regulatory constraints

.

  • Proven skills in designing and building scalable data pipelines in the

Azure ecosystem

.

  • Expertise with Databricks (PySpark/Scala) for heavy data processing and NLP workloads.
  • Experience with NLP techniques and frameworks (spaCy, Hugging Face, MLflow integration) is highly beneficial.
  • Ability to solve diverse, ambiguous data challenges across different business contexts.
  • Experience with TimeXtender or Fabric is an advantage, but not essential.

What They Offer

  • Hybrid, flexible working arrangements.
  • A role with real impact: your work enables legal teams to serve clients more efficiently and accurately.
  • Opportunities to work on a diverse mix of internal innovation projects and external client assignments.
  • A highly collaborative, mature data organisation backed by strong leadership and investment.

How to Apply

If you enjoy solving complex data problems and want to shape how a leading law firm uses data and AI to transform its practice, please apply with your CV.

Looking for more opportunities?

Browse thousands of graduate jobs and entry-level positions.

Browse All Jobs