About the Role
We're hiring a senior software/data engineer to help build the largest case law dataset. Our data coverage includes US laws and court decisions and powers our lawyer-facing AI platform and B2B data services.
Responsibilities
- Building pipelines that augment documents with metadata, e.g., which decisions overrule another decision, which decisions are an appeal/remand/consolidation of another decision, etc.
- Building systems to ensure the reliability and accuracy of hundreds of web scrapers.
- Optimizing and evaluating our core utils, which do things like extracting and resolving citations, determining which courts are able to overrule which other courts, etc.
- Exposing core services on our data via APIs, MCPs, websockets.
- Benchmarking and evaluation of core tasks (human and synthetic).
What We Are Looking For
We believe in skipping what can be skipped and appreciate simple solutions to complex problems. Good candidates for this role should be technical generalists, comfortable working across the backend (bonus for fullstack), and capable of handling data pipelines including basic to intermediate infra/devops. Interest or experience with stats, ML, or AI is a bonus. You should be able to stand up your own projects on your preferred infrastructure end-to-end.