Location
Amsterdam, North Holland, Netherlands
Salary
Not specified
Type
fulltime
Posted
Today
Job Description
NLP Data Engineer
Amsterdam (3 days in office)
Up to €90k \+ benefits
Join a highly respected company operating across, working on the most heavily regulated sectors, including financial services. The organisation is committed to modernising legal practice through advanced data capabilities - transforming how legal professionals access insights, manage information, and solve complex client challenges.
Over the past two years, significant investment has gone into building a mature data function across BI, Data Engineering, Data Governance, and Integrations. Now the focus shifts to
solving deeper, more complex data problems
, particularly involving the conversion of
unstructured legal documents into high‑quality structured datasets
.
This is a role for someone who enjoys innovation, complexity, and impact.
The Team
You'll join the Data Engineering team and report into the
Head of Data
. The team works closely with Legal, Risk, Compliance, and client-facing service teams on both internal solutions and external client projects.
The Role
In this role, you will:
🔹 Work on complex internal and external data projects
- Ingest, process, and structure large volumes of
unstructured legal and financial documents
.
- Build data pipelines that transform text-heavy content into usable, analysable datasets.
- Support client engagements by designing data solutions that help legal teams advise faster and with greater accuracy.
🔹 Apply advanced engineering to difficult data challenges
- Tackle complex data modelling, parsing, extraction, and classification issues.
- Implement NLP techniques (NER, sentiment, semantic search, vectorisation) to enhance document understanding.
- Build robust pipelines for document ingestion, OCR, and data extraction.
🔹 Use modern data technologies
- Leverage the Azure data platform for scalable ingestion and processing.
- Work in Databricks on advanced text analytics, pattern detection, and transformation logic.
- Support integration with low-code tools like Microsoft Fabric or TimeXtender where relevant.
🔹 Contribute to a maturing data ecosystem
- Help shape best practices across the engineering team.
- Partner with Data Governance, BI, and Integration teams to improve data quality and accessibility.
- Guide adoption of new tools and approaches that move the firm toward smarter, AI-enabled legal services.
Your Skills and Experience
We’re looking for someone who brings:
- Strong experience handling
unstructured data
, especially text-heavy documents.
- Solid understanding of
data management, data governance, and regulatory constraints
.
- Proven skills in designing and building scalable data pipelines in the
Azure ecosystem
.
- Expertise with Databricks (PySpark/Scala) for heavy data processing and NLP workloads.
- Experience with NLP techniques and frameworks (spaCy, Hugging Face, MLflow integration) is highly beneficial.
- Ability to solve diverse, ambiguous data challenges across different business contexts.
- Experience with TimeXtender or Fabric is an advantage, but not essential.
What They Offer
- Hybrid, flexible working arrangements.
- A role with real impact: your work enables legal teams to serve clients more efficiently and accurately.
- Opportunities to work on a diverse mix of internal innovation projects and external client assignments.
- A highly collaborative, mature data organisation backed by strong leadership and investment.
How to Apply
If you enjoy solving complex data problems and want to shape how a leading law firm uses data and AI to transform its practice, please apply with your CV.
Looking for more opportunities?
Browse thousands of graduate jobs and entry-level positions.