Back to version

Published version: AIFC-V002. This is the latest published version. All versions.

AIFC-064: Data Classification

Status: Draft 0.1 Standard: AI-First Community Standard Short name: AIFC Builds on:

Purpose of this document: Define Data Classification as a basic security and governance mechanism for the AIFC knowledge base, Source of Truth, Operational DNA, metadata, AI inputs, AI outputs, derived knowledge, audit logs, interfaces, and cross-community sharing. Data Classification makes it possible to govern access, AI processing, sharing, export, audit, retention, and protection of community know-how according to sensitivity and impact.


1. Purpose of this document

This document defines Data Classification.

An AIFC community works with knowledge that may be public, internal, sensitive, or critical.

This knowledge is not only data.

It may contain:

Without classification, the community cannot responsibly decide:

Data Classification is a foundation for safe operation of an AI-first community.


2. Core principle

The core principle of this document is:

Classify knowledge by sensitivity, purpose and impact on community capability.

AIFC states:

Do not classify only files.
Classify capability exposure.

In an AI-first community, the greatest risk may be that a well-structured summary reveals more than the individual documents.


3. Definition

Data Classification is the governed mechanism by which a community labels data, knowledge artefacts, metadata, AI inputs, AI outputs, derived knowledge, audit logs, interfaces, and other information assets according to sensitivity, purpose, risk, and impact.

Data Classification determines:

Minimum requirement

Every meaningful knowledge artefact must have a classification or inherit it from location, type, workflow, owner, or rule.


4. Why Data Classification matters

Without classification, the community does not know what it protects.

That creates two extremes.

First extreme:

Everything is open.

Result:

Second extreme:

Everything is restricted.

Result:

Good classification makes it possible to protect what is sensitive while sharing what is safe.

Minimum requirement

Data Classification must support both protection and usability of the knowledge base.


5. Classification is not only confidentiality

Classification is not only about confidentiality.

AIFC classification should consider:

For example, a public document may have low confidentiality but high reputational impact.

Internal metadata may contain little text but strongly reveal strategy.

Minimum requirement

Classification must account for impact, not only secrecy.


AIFC recommends these base classification layers:

Public
Internal
Restricted
Operational DNA

A community may extend this model according to legal, sector, or organizational needs.

Minimum requirement

The community must have clearly defined classification levels and meanings.


7. Public

Public information is intended for public sharing.

Examples:

Public does not mean ownerless.

Public artefacts must still be correct, current, approved, and reviewed.

Risks:

Minimum requirement

Public artefacts derived from the internal knowledge base must have an owner, status, and public release review.


8. Internal

Internal information is intended for the community or organization.

Examples:

Internal does not mean it can be placed into any AI.

Internal does not mean it can be sent to a vendor.

Internal does not mean it cannot contain sensitive metadata.

Minimum requirement

Internal artefacts must have rules for external sharing and AI processing.


9. Restricted

Restricted information is sensitive and has limited access.

Examples:

Restricted artefacts require:

Minimum requirement

Restricted artefacts must have owner, access control, AI processing rule, export rule, and audit appropriate to risk.


10. Operational DNA

Operational DNA is critical know-how that describes or enables community capability.

Examples:

Operational DNA is the most sensitive classification.

Leakage may mean loss of capability, competitive advantage, security, autonomy, or trust.

Minimum requirement

Operational DNA must have the highest protection, limited access, explicit AI-NDA Boundary, audit, export control, and owner.


11. Classification by content

Classification may be based on the artefact’s content.

Examples:

Minimum requirement

Classification must consider the actual artefact content, not only its name or location.


12. Classification by context

The same content may have a different classification depending on context.

Example:

A general description of a workflow may be Internal.

The same workflow combined with customer patterns, decision logic and automation rules may become Operational DNA.

Context may include:

Minimum requirement

Classification must consider the context of use and the combination of information.


13. Classification by aggregation

Aggregation may increase sensitivity.

Individual documents may be Internal.

Their synthesis may be Restricted or Operational DNA.

Example:

Minimum requirement

Aggregated or synthesized knowledge must be classified by the sensitivity of what it reveals, not only by input classifications.


14. Derived knowledge classification

AI often creates derived knowledge.

Derived knowledge is new knowledge created from existing inputs.

It may be:

Derived knowledge may be more sensitive than its inputs.

Minimum requirement

AI-generated derived knowledge must be classified by impact and by what it reveals.


15. Metadata classification

Metadata may be sensitive.

Examples:

Metadata may reveal community structure, priorities, or weaknesses.

Minimum requirement

Metadata must be classified or protected according to what it reveals.


16. Prompt and output classification

Prompts and AI outputs must be classified.

A prompt may contain sensitive data.

An output may contain:

Minimum requirement

Critical AI prompts and outputs must have classification or an audit policy that handles their sensitivity.


17. Audit log classification

Audit logs may be highly sensitive.

They may reveal:

Minimum requirement

Audit logs must have their own classification and access control.


18. Interface classification

Community Interface, Enterprise Interface, and public interface must be classified.

An interface may be public, internal, or restricted.

Risk appears when an interface reveals:

Minimum requirement

Interfaces must be reviewed against Operational DNA exposure and classified accordingly.


19. Skill classification

Human skills and AI skills may have different sensitivity.

Public skill:

Internal skill:

Restricted skill:

Operational DNA skill:

Minimum requirement

Skills must be classified by the capability they reveal.


20. Agent permission classification

Agent permissions may themselves be sensitive.

They may reveal:

Minimum requirement

Agent permissions must have classification and access control appropriate to risk.


21. Decision Record classification

Decision Records may be public, internal, restricted, or Operational DNA.

It depends on what the decision reveals.

A Decision Record may contain:

Minimum requirement

Decision Records must have classification according to content, impact, and audience.


22. Classification and AI-NDA Boundary

The AI-NDA Boundary must be derived from classification.

Example:

Public:
AI processing allowed by default unless restricted by policy.

Internal:
AI processing allowed only with approved tools or rules.

Restricted:
AI processing requires AI-NDA Boundary and purpose limitation.

Operational DNA:
AI processing requires explicit approval, strict boundary, audit and usually private or controlled environment.

Minimum requirement

Every classification level must have an AI processing rule.


23. Classification and access control

Access Control is based on classification.

Classification determines:

Minimum requirement

Access rules must be mapped to classification levels.


24. Classification and export

Export is especially risky for Restricted and Operational DNA.

Export rules must define:

Minimum requirement

Restricted and Operational DNA export must require explicit approval and audit.


25. Classification and public release

Public release is a classification change outward.

The community must check:

Minimum requirement

Moving internal or sensitive know-how into a public output must have public release review.


26. Classification and retention

Classification affects retention.

Public content may be retained long-term.

Internal content may have a review cycle.

Restricted content may have limited retention.

Operational DNA may require long-term protection, regular review, and access control.

Audit logs may have separate retention rules.

Minimum requirement

Classification levels must have retention or review rules.


27. Classification and deletion

Deletion must respect classification.

Some content must be deleted because of law or boundary.

Some content must not be deleted because of audit.

Some content should be archived.

Some content must be removed from agent memory, embeddings, or cache.

Minimum requirement

Sensitive classification levels must have deletion, archive, or retention rules.


28. Classification and embeddings

Embeddings and vector stores must inherit classification from sources.

If a source changes, is deleted, or becomes restricted, the community must address:

Minimum requirement

Embeddings from Restricted or Operational DNA content must be protected at the same or higher level as the source.


29. Classification and aggregation in Human Cockpit Layer

The Human Cockpit Layer may show aggregations.

Aggregation may reveal sensitive information even when individual items are not shown.

Examples:

Minimum requirement

The Human Cockpit Layer must classify aggregated views by what they reveal.


30. Classification and cross-community sharing

When sharing between communities, it must be clear:

Minimum requirement

Cross-community sharing of non-public knowledge requires classification mapping or explicit sharing boundary.


31. Classification inheritance

Classification may be inherited.

Examples:

Inheritance reduces friction.

But it must not be blind.

Content may require higher classification than the default.

Minimum requirement

Classification inheritance must allow classification to increase according to content and impact.


32. Classification override

Sometimes classification must change.

Override must be governed.

Examples:

Minimum requirement

Lowering the classification of Restricted or Operational DNA must require approval.


33. Classification review

Classification may become stale.

Review asks:

Minimum requirement

Restricted and Operational DNA artefacts must have a classification review cycle or trigger.


34. AI-assisted classification

AI may help with classification.

It may:

However, AI must not lower the classification of a critical artefact without approval.

Minimum requirement

AI-assisted classification must be marked as proposal until approved for Restricted or Operational DNA artefacts.


35. Misclassification

Misclassification is incorrect sensitivity labeling.

Examples:

Misclassification is a knowledge security risk.

Minimum requirement

Misclassification incidents must be recorded and handled according to impact.


36. Classification and values

Classification is not only a technical rule.

It reflects community values.

Examples:

Too low a classification may violate trust.

Too high a classification may harm learning and cooperation.

Minimum requirement

Classification policy must balance security, transparency, learning, and responsibility.


37. Classification and Human Capability Reserve

If critical know-how is too closed, the community may lose recovery capability.

The community therefore needs:

Classification should protect learning, not destroy it.

Minimum requirement

Operational DNA protection must be balanced with Human Capability Reserve through safe human-readable variants or training paths.


38. Classification and AI lock-in

If sensitive know-how exists only in an AI vendor platform, agent memory, or proprietary skill store, classification and control are weakened.

AIFC prefers classification of critical artefacts to be held in the Source of Truth or governance repository.

Minimum requirement

Critical classified artefacts must not be authoritatively classified only in an AI vendor system.


39. Classification policy

An AIFC community should have a classification policy.

It defines:

Minimum requirement

A community working with non-public know-how must have a classification policy or equivalent.


40. Suggested metadata

Example metadata for artefact classification:

classification:
  level: public | internal | restricted | operational_dna
  owner:
  reason:
  inherited_from:
  contains_personal_data: true | false
  contains_secrets: true | false
  contains_customer_data: true | false
  contains_operational_dna: true | false
  ai_processing:
    allowed: true | false
    rule: public_allowed | approved_tools_only | redaction_required | private_environment_only | explicit_approval_required | forbidden
    ai_nda_boundary:
  export:
    allowed: true | false
    approval_required: true | false
    redaction_required: true | false
  sharing:
    internal_allowed: true | false
    external_allowed: true | false
    cross_community_boundary_required: true | false
  audit_required: true | false
  retention_rule:
  review_cycle:
  last_reviewed:

Example metadata for classification review:

classification_review:
  id:
  title:
  status: scheduled | in_progress | approved | changed | escalated | closed
  artefact:
  current_classification:
  proposed_classification:
  reason:
  ai_assisted: true | false
  reviewer:
  approval_required: true | false
  decision:
  decision_record:
  created_at:
  closed_at:

Example metadata for a misclassification incident:

misclassification_incident:
  id:
  title:
  status: observed | triaged | contained | corrected | under_review | closed
  artefact:
  original_classification:
  corrected_classification:
  incident_type:
    - under_classified
    - over_classified
    - missing_classification
    - public_leak
    - ai_processing_violation
    - export_violation
    - metadata_leak
    - derived_knowledge_misclassified
  affected_communities:
  ai_involved: true | false
  impact:
  corrective_actions:
  related_change_proposal:
  owner:
  created_at:
  closed_at:

These structures are illustrative.

The final schema should be defined in the agent-actionable layer of the standard.


41. Anti-patterns

AIFC rejects the following anti-patterns.

41.1 No classification

Knowledge artefacts have no classification and access is governed by chance or convenience.

41.2 Classification by folder only

Folder location determines classification, but content is not reviewed.

41.3 Everything internal

The community marks everything as internal and ignores Restricted and Operational DNA.

41.4 Everything restricted

The community marks everything as restricted and kills learning, onboarding, and cooperation.

41.5 AI output unclassified

AI outputs are used without classification even when they contain derived knowledge.

41.6 Metadata ignored

Metadata is not protected even though it reveals sensitive information.

41.7 Aggregation ignored

Aggregation or synthesis increases sensitivity, but classification does not change.

41.8 Public release without review

Internal content is published without checking Operational DNA exposure.

41.9 AI declassification

AI lowers classification without human approval.

41.10 Vendor classification mismatch

The community shares data with a vendor without verifying that the vendor recognizes the same classification and boundary.

41.11 Embeddings without classification

Embeddings from Restricted content are stored as ordinary technical artefacts.

41.12 Classification as bureaucracy

Classification is treated as a compliance checkbox, not as protection of community capability.


42. Minimal requirements

An AIFC community must at minimum meet these Data Classification requirements:

  1. Meaningful knowledge artefacts have classification or inherit it from a rule.
  2. Data Classification supports both protection and usability of the knowledge base.
  3. Classification accounts for impact, not only secrecy.
  4. The community has clearly defined classification levels.
  5. Public artefacts derived from the internal knowledge base have owner, status, and public release review.
  6. Internal artefacts have rules for external sharing and AI processing.
  7. Restricted artefacts have owner, access control, AI processing rule, export rule, and audit.
  8. Operational DNA has highest protection, limited access, AI-NDA Boundary, audit, export control, and owner.
  9. Classification considers the actual artefact content.
  10. Classification considers context of use and combinations of information.
  11. Aggregated or synthesized knowledge is classified by what it reveals.
  12. AI-generated derived knowledge is classified by impact.
  13. Metadata is classified or protected according to what it reveals.
  14. Critical AI prompts and outputs have classification or audit policy.
  15. Audit logs have their own classification and access control.
  16. Interfaces are reviewed against Operational DNA exposure.
  17. Skills are classified by the capability they reveal.
  18. Agent permissions have classification and access control.
  19. Decision Records have classification according to content, impact, and audience.
  20. Every classification level has an AI processing rule.
  21. Access rules are mapped to classification levels.
  22. Restricted and Operational DNA export requires approval and audit.
  23. Moving internal or sensitive know-how into a public output has public release review.
  24. Classification levels have retention or review rules.
  25. Sensitive classification levels have deletion, archive, or retention rules.
  26. Embeddings from Restricted or Operational DNA content are protected like the source.
  27. The Human Cockpit Layer classifies aggregated views by what they reveal.
  28. Cross-community sharing of non-public knowledge requires classification mapping or explicit sharing boundary.
  29. Classification inheritance allows classification to increase by content and impact.
  30. Lowering classification of Restricted or Operational DNA requires approval.
  31. Restricted and Operational DNA artefacts have classification review cycle or trigger.
  32. AI-assisted classification is marked as proposal until approved for Restricted or Operational DNA.
  33. Misclassification incidents are recorded and handled according to impact.
  34. Classification policy balances security, transparency, learning, and responsibility.
  35. Operational DNA protection is balanced with Human Capability Reserve through safe variants or training paths.
  36. Critical classified artefacts are not authoritatively classified only in an AI vendor system.
  37. A community working with non-public know-how has classification policy or equivalent.

43. Summary

Data Classification is a foundation of a safe AI-first knowledge base.

Without classification, it is not possible to responsibly govern:

AIFC therefore states:

Classify what knowledge reveals.
Classify what AI derives.
Classify what aggregation exposes.
Classify what capability depends on.

Correct classification allows the community to share safely, protect precisely, and use AI without losing control.

Data Classification turns knowledge sensitivity into governed protection and usable trust.