GENERAL AI TRAINING AND FINE-TUNING DATA LICENSE

AI TRAINING AND FINE TUNING DATA LICENSE – GENERAL USE, VERSION 1.0

Version 1.0

For internal AI training, evaluation, and research; not for commercial deployment or monetisation.

Platform-Specific License Notice: > This License is issued by Opendatabay (www.opendatabay.com) to support both Data Suppliers and Data Buyers. Its use outside the Opendatabay platform, or without Opendatabay’s knowledge and authorization, is strictly prohibited. This License applies to Data Products listed on Opendatabay or provided via the platform, including samples, previews, or full datasets.

1. Definitions

"Data Product": The collection of bundled and packaged data being licensed, which may include text, audio, images, video, raw or structured data, as described in the accompanying documentation, together with any provenance, transparency, or compliance information provided by the Licensor to support AI training, evaluation, and model documentation (e.g., model cards or safety assessments).

"Licensee": The individual or entity receiving rights to use the Data Product under this License.

"Licensor / Data Provider": The individual or entity that collected, prepared, and owns the Data Product and is granting this License.

"AI / ML / LLM": Artificial Intelligence, Machine Learning, and Large Language Models, including neural networks or other model architectures trained or fine-tuned using the Data Product.

"Models": Machine learning models created, trained, or fine-tuned using the Data Product.

"Derivative Works": Outputs of Models, including model weights, embeddings, fine-tuned versions, or other representations derived from the Data Product.

"Outputs": Model predictions, generated text, audio, embeddings, or other results produced using the Data Product.

"Permitted Uses": Activities expressly allowed under this License, including Training, Fine-Tuning, Evaluation, and Research purposes. Commercial deployment, external SaaS offerings, or monetisation of Outputs are strictly prohibited under this General Use license.

"Order": The purchase agreement, invoice, or transaction record that specifies the Data Product being licensed, pricing, payment terms, Effective Date, and contact information for both parties.

"Effective Date": The date specified in the Order when this License becomes effective, typically the date of purchase or payment confirmation.

2. License Grant and Scope (Permitted Uses)

2.1 License Grant

The Licensor grants the Licensee a non-exclusive, worldwide, perpetual (subject to termination rights in Section 10) right to use this Data Product solely for internal research, evaluation, and educational purposes.

The Licensee may:

  • Train or fine-tune models, including LLMs and AI agents, AI applications and systems for internal use only.

  • Create and use model outputs, embeddings, weights, or other Derivative Works strictly for non-commercial purposes.

  • Evaluate models and publish research or benchmarks without redistributing the Data Product.

Commercial deployment, use in production API services, or any form of monetisation is NOT permitted.

2.2 Licensor Representations

The Licensor represents and warrants that:

  • It has the rights and authority necessary to license the Data Product for the Permitted Uses.

  • The Data Product was collected and provided in compliance with applicable data protection and related laws (including, where applicable, GDPR, EU AI Act and similar regimes).

  • The Licensor has used commercial best practices and reasonable technical measures to filter and exclude personal data/PII or special category sensitive data from the Data Product.

  • The Licensor is not aware of any malware, viruses, or malicious code in the Data Product.

  • The Data Product has been prepared with reasonable care and professional standards for AI training purposes.

  • To the best of Licensor’s knowledge, the Data Product has not been intentionally altered to include ‘poisoned’ samples, trigger phrases, or adversarial inputs designed to compromise the alignment or security of resulting Models.

The Licensor does not warrant that the Data Product is free from all third-party claims, nor that Licensee’s specific use of the Data Product, Models, or Derivative Works will be lawful in all jurisdictions or for all intended use cases. The Licensor does not guarantee specific model performance or outcomes.

3. Restrictions (Prohibited Uses)

This license covers internal research, evaluation, and educational use only.

3.1 Prohibited Activities

Licensee may not:

  • Use the Data Product, Models, or Derivative Works for any commercial purposes, monetisation, or deployment in external-facing AI services.

  • Resell, redistribute, or sublicense the Data Product itself.

  • Intentionally remove, obscure, or alter any digital watermarks, cryptographic signatures, or provenance metadata embedded within the Data Product. These markers are essential for verifying ethical sourcing and supply chain integrity.

  • Use the Data Product in ways that violate applicable laws or regulations, including discrimination, unlawful surveillance, or other prohibited applications.

3.2 Separate Licensing for Commercial Use

For uses involving commercial deployment, monetisation, or external distribution of Models, Licensee must obtain a Commercial Use License from the Licensor.

4. Models, Derivatives, and Redistribution Rights

4.1 Rights Granted

Licensee may create Models using the Data Product, including training, fine-tuning, or evaluating AI, ML, LLMs, and agentic AI systems. Licensee is granted a non-exclusive, worldwide, and perpetual right to:

  • Use Model outputs (predictions, text, audio, embeddings, images, or other results) for internal research or educational purposes only.

  • Create and use Derivative Works such as fine-tuned weights, embeddings, or model variants for internal purposes.

  • Evaluate Models and publish benchmarks or performance results, provided that the Data Product itself is not disclosed.

4.2 Restrictions on Data Product Redistribution

Licensee may not redistribute or sublicense the original Data Product itself.

4.3 Sharing of Models and Derivative Works

Licensee may share Models and Derivative Works only within the Licensee’s organization for research or educational purposes. External distribution is prohibited unless the underlying Data Product is not exposed and the distribution is strictly non-commercial (e.g., academic collaboration).

Safe Harbor for Model Outputs: The inadvertent, non-systematic reproduction of insignificant portions of the Data Product by a Model (e.g., through overfitting or stochastic generation) shall not constitute a breach of this Section, provided that the Licensee has not intentionally engineered the Model to act as a database or substitute for the Data Product.

4.4 Responsibilities and Liability

Licensee is responsible for its own use of Models and Derivative Works. While Licensee may release Derivative Works (such as Model Weights) for non-commercial research, Licensee acknowledges they are solely responsible for implementing ‘defence-in-depth’ safeguards and ensuring such releases comply with applicable Frontier AI Safety Frameworks.

5. Compliance Obligations

Licensee must comply with all applicable laws, including:

  • GDPR and other data protection regulations

  • EU AI Act transparency and documentation requirements

  • Export control regulations (see Section 5.2)

  • Copyright and intellectual property laws

  • Anti-discrimination and human rights laws

5.2 Export Control and Sanctions Compliance

Licensee represents that it is not subject to trade sanctions or export restrictions under applicable law. Licensee will not use the Data Product or Models in violation of export control laws including U.S. Export Administration Regulations (EAR), International Traffic in Arms Regulations (ITAR), or equivalent regulations.

5.3 Data Security and Protection

Licensee must implement appropriate technical and organizational measures to protect the Data Product from unauthorized access, disclosure, or misuse.

5.4 High-Risk AI Systems

Licensee may not use the Data Product for high-risk AI systems in production environments under this license.

5.5 EU AI Act Transparency Requirements

For general-purpose AI models covered by EU AI Act Article 53, Licensee acknowledges responsibility for publishing training data summaries where applicable.

6. Data Retention, Deletion, and Audit

6.1 Retention

Licensee may keep copies of the Data Product only as long as needed for internal training, evaluation, or research, and in line with applicable laws.

6.2 Deletion

Upon termination of this License (subject to Section 10.2) or written request from the Licensor, Licensee must promptly delete or irreversibly anonymize all copies of the Data Product, except for copies retained in standard backup systems that are scheduled for automatic deletion within a reasonable timeframe.

6.3 Models & Derivatives

Licensee may retain Models, Derivative Works, logs, and metadata indefinitely, provided the Data Product itself cannot be reconstructed from these materials.

6.4 Audit Rights

Where Licensor has reasonable suspicion of material breach of this License, Licensor may request reasonable evidence of compliance (e.g., deletion records, data retention policies, or security measures) no more than once per year, with sixty (60) days’ advance written notice.

7. Intellectual Property, Attribution, and Trademarks

7.1 Ownership of Data Product

The Data Product, including all content, metadata, and structure, is owned by the Licensor. Except for the rights granted under this License, no ownership or interest in the Data Product is transferred to the Licensee.

7.2 Ownership of Models and Derivative Works

Licensee owns Models and Derivative Works it lawfully creates using the Data Product, but this ownership does not include or extend to the Data Product itself.

7.3 Attribution

Neither party may publicly reference the other party or disclose the other party’s use of the Data Product or this licensing relationship without explicit written consent, except as required by law or regulation.

7.4 Trademarks and Branding

All Licensor trademarks, logos, and trade names remain the property of the Licensor. Licensee may not use them in ways that imply endorsement, sponsorship, or partnership without separate written permission.

8. Warranty Disclaimer and Liability Limits

8.1 Limited Data and IP Warranty

Licensor warrants that it has the right to license the Data Product under this License and that the Data Product was not obtained in violation of applicable law. Licensor does not warrant that the Data Product is free from all third-party claims, nor that Licensee’s specific use of the Data Product will be lawful in all jurisdictions.

8.2 No Other Warranties

Except for the limited warranties set out in Section 8.1, the Data Product is provided “as is” and “as available.” To the fullest extent permitted by law, Licensor disclaims all other warranties, whether express or implied.

8.3 No Guarantee of Results

Licensor does not guarantee any specific outcomes, performance, or results arising from the use of the Data Product.

8.4 Limitation of Liability

To the fullest extent permitted by applicable law, each party’s total aggregate liability to the other for all claims arising out of or relating to this License will not exceed the greater of: (a) the total amounts paid by Licensee for the Data Product under this License; or (b) any fixed liability cap specified in the Order or applicable commercial terms.

8.5 Exclusion of Certain Damages

To the fullest extent permitted by law, neither party will be liable to the other for any indirect, incidental, consequential, special, punitive, or exemplary damages.

9. Indemnification

9.1 Licensor Indemnification

Licensor agrees to indemnify, defend, and hold harmless Licensee from and against any third-party claims, damages, losses, or expenses (including reasonable attorneys’ fees) arising from Licensor’s breach of the warranties in Section 8.1, provided that:

  • Licensee promptly notifies Licensor in writing of the claim.

  • Licensor has sole control of the defense and settlement of the claim.

  • Licensee reasonably cooperates in the defense at Licensor’s expense.

9.2 Licensee Indemnification

Licensee agrees to indemnify, defend, and hold harmless Licensor from and against any third-party claims, damages, losses, or expenses (including reasonable attorneys’ fees) arising from:

  • Licensee’s use of the Data Product, Models, or Derivative Works in violation of this License (including unauthorized Commercial Use).

  • Licensee’s violation of applicable laws or regulations.

10. Term, Termination, and General Terms

10.1 Term & Termination

This License starts on the Effective Date and continues perpetually, subject to the following termination rights:

  • Either party may terminate for material breach if the breaching party fails to cure the breach within thirty (30) days of written notice.

  • Licensor may terminate immediately if required by applicable law or court order, or if Licensee’s use creates substantial legal, security, or compliance risks.

10.2 Effect of Termination

For termination due to Licensee’s material breach of payment obligations:

  • All rights to use the raw Data Product end immediately.

  • Licensee must delete the Data Product as per Section 6.2.

  • Licensee may retain and continue to use Models and Derivative Works created prior to termination, provided that Licensee cures the payment breach (including any applicable interest) within thirty (30) days of termination.

For all other termination scenarios:

  • All rights to use the Data Product end immediately.

  • Deletion/anonymization obligations in Section 6.2 apply.

  • Licensee may continue to use Models and Derivative Works created before termination under the terms of this License.

  • Sections 4.4, 7, 8, 9, and 10 survive termination.

10.3 Governing Law & Jurisdiction

This License is governed by the laws of England and Wales, without regard to conflict of law principles. Any disputes arising out of or relating to this License shall be resolved exclusively in the courts of London, England.

10.4 Notices

All notices under this License must be in writing and sent to the contact information specified in the Order. Notices may be delivered by email (with read receipt or confirmation), registered mail, or courier service.

10.5 Assignment

Licensee may not assign, transfer, or delegate its rights or obligations under this License without Licensor’s prior written consent, which shall not be unreasonably withheld.

10.6 Entire Agreement and Amendments

This License, together with the Order, constitutes the entire agreement between the parties regarding the Data Product and supersedes all prior agreements. This License may only be amended by a written document signed by both parties.

10.7 Severability

If any provision of this License is held to be invalid, illegal, or unenforceable, the remaining provisions shall continue in full force and effect.

10.8 Waiver

No failure or delay by either party in exercising any right under this License constitutes a waiver of that right.

10.9 Platform Third-Party Beneficiary

The parties acknowledge that this License is facilitated by Opendatabay. The parties agree that Opendatabay is a third-party beneficiary of this License solely for the purposes of enforcing the prohibition on off-platform transactions or circumvention of Platform fees (if applicable under Opendatabay’s Terms of Service).

END OF LICENSE

This license provides balanced protection for both data sellers and buyers, with clear definitions, fair termination provisions, symmetric confidentiality, proportional liability, and compliance with current AI regulations including EU AI Act requirements and export control considerations.

© [2026] Opendatabay. All Rights Reserved. This license is the proprietary intellectual property of Opendatabay. It is licensed for use solely for transactions facilitated by or authorized by Opendatabay. Unauthorized reproduction, modification, or distribution of this legal text for use on competing platforms or external transactions without Opendatabay’s express written consent is strictly prohibited and may result in legal action.

Last updated