Security

Why Your NDA Doesn't Protect You From Translation Tool Data Leaks

NDAs govern people, not software pipelines. If your translation tool trains on your text or logs it, your NDA with the counterparty is irrelevant. Here's the gap analysis.

Y
Yash Khare·LinkedIn··7 min read
Why Your NDA Doesn't Protect You From Translation Tool Data Leaks

I had a conversation with a startup founder last month that stuck with me.

she was about to close a funding round. her lawyer sent over the term sheet in German. she needed to understand it before the call. so she pasted the whole thing into a free translation tool.

when I asked about confidentiality, she said: "it's fine, we have an NDA with the investor."

and technically, she's right. the NDA covers the investor. it covers her team. it covers the lawyer.

it does not cover Google.

The false comfort of 'they signed an NDA'

NDAs are contracts between parties. they say: you and I agree not to share this information with third parties.

the key word is parties. an NDA binds the humans (and companies) who sign it. it doesn't bind the software those humans use to process the information.

when you paste a confidential term sheet into a translation tool, you haven't asked the translation tool to sign an NDA. you've voluntarily disclosed the information to a third party — one whose terms of service may include the right to use your input for model training, analytics, or service improvement.

the NDA protects you from the other side leaking. it does nothing about your side leaking through tooling.

and here's the uncomfortable part: the other side might have a legitimate breach claim against you. "you disclosed our confidential term sheet to a data processor without our consent" is a real argument. and it's a strong one.

What NDAs actually cover vs what translation tools do

let's be specific about the gap.

What your NDA typically covers

  • the parties can't share the information with unauthorized third parties
  • the information must be treated as confidential
  • there are usually exceptions for "professional advisors" and "agents"
  • there's a defined scope (what counts as confidential) and a time period

What translation tools typically do

  • training: some tools use your text to improve their models. your confidential information becomes part of a shared dataset used to generate outputs for other users
  • logging: some tools log your full translation request for debugging, analytics, or abuse detection. your text sits in a database, accessible to the vendor's employees and potentially their subprocessors
  • retention: some tools store your translation indefinitely. even tools that claim short retention may keep content in backups, caches, or log files
  • subprocessing: your text often passes through multiple systems — the vendor's frontend, their API, their cloud provider, their translation engine. each hop is another entity with access

here's the thing: none of these entities signed your NDA.

3 real scenarios where NDAs fail to protect

these aren't hypothetical. they're patterns I've seen or heard about from legal teams.

Scenario 1: the M&A leak

a company is in acquisition talks. the target sends due diligence documents in French. the buyer's finance team pastes them into a free translation tool to speed up review.

the tool trains on the data. six months later, a competitor's employee uses the same tool and gets suspiciously relevant autocomplete suggestions.

the NDA between buyer and target? completely irrelevant. the buyer disclosed the information to a third party (the translation tool) without authorization. the NDA breach is on the buyer's side.

Scenario 2: the cross-border HR dispute

a multinational company translates employee performance reviews from German to English using a tool that logs translation content.

an employee files a GDPR subject access request. the company has to disclose all personal data it holds — including data held by processors. they now have to explain why employee performance reviews were shared with a translation vendor, whether a DPA was in place, and whether the employee consented.

the NDA between the company and the employee? doesn't cover this. this is a GDPR processing issue, and the NDA doesn't substitute for proper data processing governance.

Scenario 3: the startup term sheet

a founder pastes a term sheet into ChatGPT to understand the liquidation preferences. the investor's law firm later discovers this during a routine due diligence check on data hygiene.

the investor pulls the deal. not because the information was actually leaked — but because the founder demonstrated a pattern of careless data handling that raises questions about how they'll handle customer data, IP, and trade secrets.

the NDA was in place. the information was technically shared with OpenAI's servers. the trust is broken.

What actually protects you

if NDAs aren't enough, what is? the answer is boring but correct: technical controls.

Data Processing Agreements (DPAs)

a DPA is a legally binding contract between you (the data controller) and the translation vendor (the data processor). unlike an NDA, a DPA specifically governs how the vendor handles your data — what they can do with it, how long they keep it, and what happens when you want it deleted.

this is what GDPR Article 28 requires. if you're processing personal data through a translation tool and you don't have a DPA, you're already non-compliant.

Zero-retention architecture

the most effective protection against translation data leaks is... not having the data to leak. tools with zero-retention architecture delete your content after a defined window (e.g., 30 minutes). there's no training set to leak into, no log to breach, no backup to subpoena.

this isn't a policy. it's an architectural constraint. the data doesn't exist, so it can't be disclosed.

No-training guarantees

separate from retention, you need a contractual commitment that your content will never be used for model training. this should be in the DPA, not just on a marketing page.

"we don't train on your data" on a website is a statement. "we don't train on your data" in a DPA is a legally enforceable commitment.

Vendor evaluation

for any tool touching confidential information, run it through the same vendor evaluation checklist you'd use for any data processor. training policy. retention window. residency. access controls. subprocessors.

RequirementWhy it matters
Signed DPALegally binds the vendor to data handling commitments
No-training commitmentPrevents content from entering model training pipelines
Defined retention windowEnsures content is deleted within a verifiable timeframe
EU data residencyKeeps processing within your regulatory jurisdiction
Subprocessor transparencyYou know exactly who touches your data
No content loggingPrevents your text from sitting in debug or analytics logs
Incident response processWhat happens if there's a breach at the vendor level

if a vendor can't provide all of these, they're not ready for confidential work. use them for marketing copy and internal memos. use something else for the sensitive stuff.

What to tell your team

the practical takeaway for most organizations is a simple rule:

if it's covered by an NDA, it needs a DPA-grade translation tool.

print that on a poster. put it in the security training. make it part of the onboarding checklist.

the NDA tells you the information is important. the DPA tells you the tool is safe to process it.

one without the other is a gap. and gaps are where breaches happen.

Takeaways

  • NDAs protect you from the other party leaking — they don't protect you from your own tooling leaking
  • pasting NDA-covered content into a free translation tool may constitute unauthorized third-party disclosure
  • the gap between "we signed an NDA" and "our tools are compliant" is where real risk lives
  • DPAs, zero-retention architecture, and no-training guarantees are what actually protect confidential content
  • if the information is important enough to need an NDA, it's important enough to need a secure translation path

Further reading

Tags

securityprivacysensitive-documentsconfidentialcompliancelegal

Related Articles

Try noll for free

Translate your sensitive documents with zero data retention. Your files are automatically deleted after download.

Get started for free

Browse by Topic

All posts
Why Your NDA Doesn't Protect You From Translation Tool Data Leaks | noll.to | www.noll.to