private-ai comparison australia compliance

Private LLM vs Public LLM: What Australian Organisations Need to Know in 2026

Sasa Abe | | 15 min read

A private LLM runs on your infrastructure and keeps your data within your walls. A public LLM runs on someone else's servers and processes your data externally. For Australian organisations subject to the Privacy Act 1988, APRA prudential standards, or professional confidentiality obligations, this distinction determines whether AI adoption creates a compliance asset or a compliance liability.

Defining the Terms

The terminology around AI models has become muddled by marketing. Before comparing private and public LLMs, let us be precise about what each term means.

What Is a Public LLM?

A public LLM is a large language model operated by a third-party provider and accessed over the internet. You send your data to their servers, their infrastructure processes it, and the response is returned to you.

Examples of public LLMs:

  • ChatGPT (OpenAI) — hosted in the United States
  • Gemini (Google) — hosted across Google's global data centres
  • Microsoft Copilot — hosted on Microsoft Azure (primarily US and EU)
  • Claude (Anthropic) — hosted in the United States
  • Perplexity AI — hosted in the United States

When you or your staff type a query into any of these services, the text of that query — and any documents uploaded alongside it — leaves your environment and travels to servers operated by a US-headquartered company.

What Is a Private LLM?

A private LLM is a large language model that runs entirely on infrastructure you control. Your data never leaves your premises. No third party processes your queries. No external server stores your documents.

A private LLM typically uses open-source models such as:

  • Llama 3 (Meta) — high-performance general-purpose model
  • Gemma 4 (Google) — efficient model optimised for enterprise tasks
  • Mistral (Mistral AI) — strong reasoning and instruction-following
  • Hermes Agent — fine-tuned for agentic workflows and document tasks

These models are deployed using tools like Ollama (for model management) and ChromaDB or similar vector databases (for document retrieval), running on hardware in your office, server room, or private data centre.

The Key Distinction

Public LLM Private LLM
Where data is processed Provider's cloud servers (overseas) Your own infrastructure (on-premises)
Who has access Provider's employees, systems, and subprocessors Only your authorised staff
Who controls retention Provider (per their terms of service) You (per your own policies)
Internet required Yes, always No — runs fully offline
Model updates Provider decides when and how You decide if and when to update

The Complete Comparison

Data Handling and Privacy

Factor Public LLM Private LLM
Data leaves your premises Yes No
Third party processes your data Yes No
Data potentially used for model training Depends on tier and provider Never
Cross-border data transfer Almost always (to US/EU) Never
Data retention controlled by you No Yes
Full audit trail available Limited Complete
You can verify deletion No Yes
Compliant with APP 8 by default No (requires assessment) Yes (no disclosure occurs)
Compliant with APP 11 by default Requires provider assessment Under your direct control

For organisations handling client-privileged information, patient health records, or regulated financial data, the data handling differences are not academic. They determine whether you can demonstrate compliance to a regulator, respond to a client query about data handling, or survive a privacy audit without qualification.

Performance and Capability

Capability Public LLM Private LLM
General knowledge and reasoning Excellent Very good (model-dependent)
Creative writing and brainstorming Excellent Good
Enterprise document search (RAG) Limited (doesn't know your docs) Excellent (trained on your corpus)
Contract and document review Generic (no firm-specific context) Specific (knows your precedents)
Accuracy on your internal policies Poor (hallucinates confidently) High (retrieves from your actual documents)
Response latency Variable (depends on server load) Consistent (dedicated hardware)
Availability Depends on internet and provider uptime 100% uptime under your control
Customisation for your industry None (same model for everyone) Full (configured for your terminology and documents)

A common misconception is that private LLMs are significantly less capable than public ones. In 2024, this was partially true. In 2026, open-source models have closed the gap substantially for enterprise use cases. For the document-centric tasks that matter most to professional services firms — summarisation, search, analysis, and drafting — a well-configured private LLM with retrieval augmented generation (RAG) outperforms a public LLM that has no access to your documents.

The public LLM knows everything about the world but nothing about your firm. The private LLM knows everything about your firm and enough about the world to be useful.

Cost Comparison

Cost Element Public LLM (ChatGPT Enterprise) Private LLM (AIRGAP LLM)
Per-user licence AUD $45-$90/user/month $0
10 users, 12 months $5,400 - $10,800/year $0 licence cost
30 users, 12 months $16,200 - $32,400/year $0 licence cost
100 users, 12 months $54,000 - $108,000/year $0 licence cost
Hardware (one-time) $0 (cloud-hosted) $2,000 - $25,000
Deployment services (one-time) $0 (self-service) $10,000 - $50,000
Monthly support Included in licence $1,500 - $3,000/month
Cost in Year 1 (30 users) $16,200 - $32,400 $28,000 - $71,000
Cost in Year 2 (30 users) $16,200 - $32,400 (again) $18,000 - $36,000 (support only)
Cost in Year 3 (30 users) $16,200 - $32,400 (again) $18,000 - $36,000 (support only)
3-year total (30 users) $48,600 - $97,200 $64,000 - $143,000 (conservative)
Cost per additional user $45-$90/month (every user, forever) $0 (no marginal cost)

The cost crossover depends on team size and usage intensity. For organisations with 30+ users, the private LLM becomes cheaper within 18-24 months. For organisations with 100+ users, the private deployment is dramatically cheaper from year 2 onward because there are no per-seat fees.

More importantly: the cost comparison ignores the value of what you avoid. A data breach involving privileged client information or patient records carries costs — regulatory penalties up to AUD $50 million under the Privacy Act, professional indemnity claims, reputational damage, and client attrition — that dwarf the difference in deployment costs.

Compliance and Regulatory Alignment

Regulation Public LLM Impact Private LLM Impact
Privacy Act 1988 (APP 8) Cross-border disclosure to US/EU servers. You bear accountability for the overseas recipient's conduct. No cross-border disclosure. APP 8 does not apply.
Privacy Act 1988 (APP 11) Security depends on provider. Limited visibility into their controls, access, and subprocessors. Security under your direct control. Full visibility. Auditable.
Privacy Act 1988 (APP 6) Sending data to a third-party AI may constitute a use or disclosure beyond the original collection purpose. Data used within your organisation for the purpose it was collected. No third-party disclosure.
APRA CPS 234 Cloud AI expands your information security threat surface. Requires assessment of provider's security posture. Reduced threat surface. No external data transmission. Security controls are yours.
APRA CPS 230 Using cloud AI operationally may trigger material service provider obligations and third-party risk management requirements. No third-party dependency. No CPS 230 trigger.
My Health Records Act 2012 Section 77 penalties for unauthorised disclosure. Sending health data to external AI creates disclosure risk. Data remains within your controlled environment. No disclosure occurs.
Legal Profession Uniform Law Client privilege may be waived if information is disclosed to a third-party AI service. This risk cannot be mitigated by contract alone. Privilege maintained. No third-party disclosure. Same confidentiality boundaries as your existing document handling.
Evidence Act 1995 Privileged communications lose protection upon disclosure to third parties. AI query logs on external servers create discoverable records outside your control. All processing internal. No records on external systems. Privilege preserved.

For regulated Australian organisations, private LLM deployment does not merely reduce compliance risk — it eliminates entire categories of regulatory concern that public LLM usage creates.

What Public LLM Providers Say About Privacy (And What They Don't Say)

Public LLM providers are aware of privacy concerns and have responded with enterprise tiers, data processing agreements, and opt-out mechanisms. It is worth examining what these actually provide.

"We don't train on your data" (Enterprise tiers)

Most providers now offer enterprise plans that exclude your data from model training. This addresses one concern but not the fundamental issue: your data still travels to and is processed on their servers. The data is still subject to:

  • Their employee access controls (which you cannot verify)
  • Their subprocessors (which may change without notice)
  • Their legal jurisdiction (US law, including potential government access under CLOUD Act or FISA)
  • Their security posture (which you must take on faith)

Not training on your data is necessary but not sufficient for organisations with genuine confidentiality obligations.

"Your data stays in Australia" (Regional hosting)

Some providers offer Australian-region hosting. This helps with APP 8 (cross-border disclosure) but does not address:

  • Third-party processing (a US company still processes your data, even if the server is in Sydney)
  • The provider's employee access (staff in any country may access data for support or monitoring)
  • Vendor lock-in (the provider can change regions, terms, or pricing at will)
  • Auditability (you still cannot inspect exactly what happens to your data)

Regional hosting is better than overseas hosting, but it is not equivalent to on-premises control.

"We have SOC 2 and ISO 27001"

Certifications demonstrate that the provider has security processes. They do not:

  • Give you visibility into specific handling of your data
  • Prevent authorised employees from accessing your information
  • Override the legal jurisdiction in which the provider operates
  • Protect you if the provider experiences a breach

Certifications are about the provider's general posture, not about your specific data.

The Models Behind Private LLMs in 2026

A common concern is that private LLMs use "lesser" models compared to the cutting-edge public offerings. This deserves a factual response.

The Current Landscape of Open-Source Models

Model Parameters Strength VRAM Required Suitable For
Llama 3 (70B) 70 billion Strong general reasoning, instruction following 48GB Large enterprise deployments
Llama 3 (8B) 8 billion Fast, efficient, good for most tasks 8GB Small-medium teams
Gemma 4 Various Efficient, strong at structured tasks 8-24GB Document analysis, summarisation
Mistral Large 123 billion Excellent reasoning and multilingual 80GB+ Complex analysis, multi-language
Mistral (7B) 7 billion Fast, capable, low resource requirement 8GB Quick queries, high-volume use
Hermes Agent Various Agentic tasks, tool use, document workflows 8-48GB Automated document processing

These models are free to download, free to run, and free to deploy commercially. There are no licence fees, no per-query costs, and no terms of service restricting how you use them with your own data.

How RAG Bridges the Knowledge Gap

Public LLMs have vast general knowledge but know nothing about your documents. Private LLMs know less about the world but, when combined with Retrieval Augmented Generation (RAG), know everything about your documents.

RAG works by:

  1. Indexing your internal documents into a vector database (such as ChromaDB)
  2. When a user asks a question, finding the most relevant document passages
  3. Feeding those passages to the LLM as context
  4. Generating a response grounded in your actual documents, with citations

The result: a private LLM with RAG gives answers sourced from your contracts, policies, precedents, and records — with references to the specific documents. A public LLM gives general answers based on its training data, which may be outdated, incorrect, or irrelevant to your specific context.

For a law firm reviewing contracts, a healthcare provider searching clinical guidelines, or a financial services firm checking compliance policies, the RAG-powered private LLM is more useful than a public LLM precisely because it knows your documents.

Who Should Choose Private LLM

Private LLM deployment is the right choice for organisations that meet any of the following criteria:

  • You handle legally privileged information. Law firms, barristers, in-house legal teams. Privilege may be waived upon disclosure to a third party. Private deployment preserves privilege absolutely.

  • You process patient health data. Hospitals, clinics, allied health providers, aged care. The My Health Records Act 2012 imposes criminal penalties for unauthorised disclosure. Private deployment eliminates third-party disclosure.

  • You are APRA-regulated. Banks, insurers, superannuation funds, wealth managers. CPS 234 requires you to manage information security risk, and CPS 230 requires you to manage third-party operational risk. Private deployment reduces both.

  • You serve government clients. Defence industry, contractors, consultancies with government engagements. Sovereign data requirements may prohibit external AI processing. Private deployment keeps data within Australian-controlled infrastructure.

  • You have 20+ staff who would benefit from AI. At this scale, private deployment is cost-competitive with public LLM subscriptions within 18-24 months while providing superior privacy, compliance, and customisation.

  • Your staff are already using AI informally. If people are pasting sensitive data into ChatGPT without approval, you have an immediate governance problem. Private deployment gives them a sanctioned alternative that is more useful (because it knows your documents) and safer (because data stays internal).

Who Should Choose Public LLM

Public LLM services remain the better choice in some scenarios:

  • Your organisation does not handle sensitive or regulated data
  • You need AI primarily for marketing content, social media, or public-facing communications
  • You have fewer than 10 users and no internal document corpus
  • Speed of deployment is the overriding priority (public LLMs are available immediately)
  • Your use cases are general-purpose rather than document-specific

There is no shame in choosing a public LLM when your risk profile supports it. The issue arises when organisations with genuine compliance obligations default to public LLMs because they seem easier, without assessing the regulatory implications.

The Hybrid Approach

Some organisations adopt a hybrid strategy: public LLM for non-sensitive tasks, private LLM for anything involving regulated or privileged data.

This can work, but it requires:

  • Clear policies defining what constitutes "sensitive" data
  • Staff training on which tool to use for which task
  • Technical controls preventing sensitive data from reaching the public LLM
  • Regular auditing to verify compliance with the boundary

In practice, AIRGAP LLM finds that most regulated organisations are better served by deploying private AI for all internal tasks. The cost difference is negligible once you have the infrastructure, and the governance overhead of maintaining a split approach often exceeds the value.

Making the Decision

If your organisation handles sensitive data in a regulated industry, the comparison between private and public LLMs is not primarily about capability or cost. It is about control.

A public LLM offers convenience in exchange for surrendering control over where your data goes, who processes it, and what happens to it afterwards.

A private LLM requires more upfront investment in exchange for retaining complete custody of your data, full compliance demonstrability, and a system configured specifically for your documents and workflows.

For Melbourne-based organisations evaluating this decision, AIRGAP LLM offers a confidential initial consultation to assess your specific situation — your data sensitivity profile, compliance obligations, team size, and use cases. No obligation, no sales pressure, just a clear-eyed assessment of which approach suits your organisation.

Book a consultation or call us to discuss your requirements.

Frequently Asked Questions

What is the difference between a private LLM and a public LLM?

A private LLM runs on infrastructure you control — your own server, on your premises, with no external data transmission. A public LLM (such as ChatGPT, Gemini, or Copilot) runs on the provider's cloud servers, meaning your data leaves your environment and is processed by a third party. The core difference is data custody: private means you retain full control, public means you delegate control to the provider.

Is a private LLM as capable as ChatGPT or GPT-4?

For enterprise document tasks — summarisation, search, drafting, contract review, and Q&A against internal knowledge — modern open-source models like Llama 3, Gemma 4, and Mistral perform comparably to public LLMs. Public LLMs have broader general knowledge and stronger creative writing, but for business workflows involving your own documents, private models deliver equivalent or superior results because they are configured specifically for your data.

Do Australian privacy laws require private LLM deployment?

The Privacy Act 1988 does not explicitly mandate private deployment. However, APP 8 (cross-border disclosure) and APP 11 (security of personal information) create significant compliance obligations when sending sensitive data to overseas cloud AI providers. For organisations handling privileged, health, or financial data, private deployment is often the most practical path to compliance — not because the law demands it specifically, but because it eliminates the hardest compliance challenges.

Can I run a private LLM without a dedicated IT team?

Yes. A deployment partner like AIRGAP LLM handles the initial setup, configuration, and ongoing maintenance. Your organisation needs no AI expertise or dedicated infrastructure team. The system runs on a single server (or a Mac Mini for smaller teams) and requires minimal day-to-day management once deployed.

How much does a private LLM cost compared to ChatGPT Enterprise?

ChatGPT Enterprise costs AUD $45-$90 per user per month — for 30 users, that is $16,200-$32,400 per year in perpetuity. A private LLM deployment costs $15,000-$35,000 upfront (hardware plus setup) and $1,500-$2,500 per month in support, with zero per-user fees. By month 18-24, the private deployment is cheaper, and the cost advantage grows with every additional user.

SA

Sasa Abe

Co-Founder, AIRGAP LLM

Software engineer specialising in privacy-focused AI architecture, RAG systems, and local LLM deployment for data-sensitive organisations.

About our team →

Want to See How This Works for Your Firm?

We'll walk you through a deployment that fits your setup — your documents, your infrastructure, your compliance requirements. No sales pitch.

Request a Consultation

Or email us directly at hello@airgapllm.com.au