Near State-of-the-Art NLPID: 103

Japanese Contract Analysis SaaS

NLP-powered SaaS platform for Japanese contract analysis achieving near state-of-the-art performance, later expanded to English contracts using deep learning.

PythonNLPDeep LearningTypeScript
Japanese Contract Analysis SaaS
Click to watch in action

The Challenge

Japanese contract analysis required deep linguistic understanding of complex legal terminology, sentence structures, and implicit obligations — with very limited training data for legal NLP tasks.

The Solution

Combined deep learning models with domain-specific rule-based processing. Collaborated closely with legal experts for data annotation, and iterated rapidly on clause risk detection and contract information extraction models.

System Architecture

Deep Learning Models

Transformer-based models fine-tuned for Japanese legal NLP tasks including clause classification and risk detection.

Rule-Based Pipeline

Domain-specific processing rules developed with legal experts to handle edge cases and legal conventions.

ML Lifecycle

End-to-end pipeline covering data annotation, model training, evaluation, and production deployment.

Key Outcomes

Achieved near state-of-the-art performance in Japanese contract clause analysis.

Owned full ML lifecycle from data annotation through deployment.

Improved clause risk detection and contract information extraction.

Spearheaded expansion to English contract analysis using deep learning.

Tech Foundation

NLP & ML
PythonPyTorchTransformersNLP
Data
Data AnnotationML PipelinesEvaluation
Backend
FastAPIPostgreSQLDocker