We're hiring a senior software engineer to help build the largest case law dataset. Our data coverage includes US laws and court decisions and powers our lawyer-facing AI platform and B2B data services.
Responsibilities include:
Building pipelines that augment documents with metadata, e.g., which decisions overrule another decision, which decisions are an appeal/remand/consolidation of another decision, etc. Our competitors still label these by humans making $300+/hr.
Building systems to ensure the reliability and accuracy of hundreds of web scrapers.
Optimizing and evaluating our core utils, which do things like extracting and resolving citations, determining which courts are able to overrule which other courts, etc.
Exposing core services on our data via APIs, MCPs, websockets.
Benchmarking and evaluation of core tasks (human and synthetic).
We believe in skipping what can be skipped and appreciate simple solutions to complex problems.
Good candidates for this role should be (1) technical generalists, definitely across the backend (bonus for fullstack), and (2) comfortable working with data pipelines, including basic to intermediate infra/devops.
Interest/experience with stats/ML/AI is a bonus, but not critical. You should be cautiously AI-pilled.
Tech stack isn't critical, Python and SQL are core. Definitely be able to stand up your own projects on your preferred infra end-to-end.
This is a remote role. Additional compensation offered for relocation to NYC.
Skills: Python, PostgreSQL, ElasticSearch, Playwright, GCP, Pinecone, Prefect, NeonDB.
Visa sponsorship is not available.
Similar Jobs
What you need to know about the San Francisco Tech Scene
Key Facts About San Francisco Tech
- Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Google, Apple, Salesforce, Meta
- Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
- Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
- Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
- Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

.png)