The Authenticity Crisis: AI Is Rewriting Assessment Integrity in Australian VET

June 3, 2026 Sukh Sandhu

Assessment integrity is no longer a background compliance issue. In the age of artificial intelligence, it is becoming one of the defining quality tests for the entire VET sector.

For years, assessment integrity in vocational education and training was treated as something stable. It mattered, of course, but it was largely understood through familiar risks. Plagiarism. Collusion. Contract cheating. Poor assessment design. Weak validation. Inconsistent judgement. These were serious problems, but they sat inside a recognisable system. RTOs knew the language, regulators knew the warning signs, and the sector broadly understood what had to be protected.

Artificial intelligence has changed that.

The rise of generative AI has done more than introduce a new technology into education. It has unsettled some of the sector’s most basic assumptions about evidence, authenticity, authorship and competence. Tasks that once appeared to reveal student understanding can now be completed, refined or transformed with astonishing ease. Responses that look polished, coherent and technically accurate may no longer tell us much about what the learner actually knows or can do. The confidence many providers once placed in written evidence has been shaken, and rightly so.

This is not just a digital disruption story. It is a quality assurance story. It is a governance story. It is a standard story. In the Australian VET context, where qualifications are expected to reflect real capability and workplace readiness, the implications are profound. If assessment can no longer be trusted to show that the student, not a tool, has demonstrated the required knowledge and skill, then the credibility of the outcome itself starts to weaken.

That is why the conversation about AI and assessment integrity can no longer sit at the edges of sector debate. It has moved to the centre. And it is forcing RTOs to confront a difficult but unavoidable question. How do you prove competence in a world where machines can produce convincing evidence on demand?

The problem is bigger than cheating

It is tempting to frame the issue narrowly, as though the challenge begins and ends with student misconduct. That is too simplistic, and it lets the sector off too lightly. The real disruption created by AI is not only that some learners may misuse it. The deeper problem is that AI has exposed weaknesses that were already present in many assessment systems.

Where assessments are generic, over-reliant on written responses, poorly contextualised, lightly supervised or weakly authenticated, AI does not merely introduce a new risk. It amplifies old ones. It reveals how fragile some existing models already were. A task that can be completed by a chatbot in seconds may not have been a strong test of competence in the first place. An assessment system that cannot distinguish between genuine learner understanding and polished machine-assisted output may have been too dependent on surface-level products all along.

This is why the sector must resist the comforting idea that AI is simply an external threat arriving from outside. In many cases, AI is functioning like a stress test. It is showing RTOs where their assessment practice was already too thin, too trusting or too mechanically designed.

That should concern providers, but it should also focus them. The challenge is not only to stop misuse. It is to build assessment systems strong enough to retain meaning in an AI-enabled environment.

Authenticity has become the hardest question in the room

For a long time, authenticity was discussed in fairly straightforward terms. Did the student complete the work themselves? Is the evidence their own? Can the assessor be satisfied that the performance belongs to the learner? Those questions still matter, but they have become much harder to answer with confidence.

Generative AI tools can now draft reports, summarise readings, answer short questions, create reflective-sounding responses, restructure weak writing into polished prose and produce plausible project material in seconds. The result is that the old visual cues many assessors relied upon are losing their value. A fluent submission is no longer strong evidence of understanding. A neatly written answer is no longer reliable proof of capability. Even work that appears personalised may have been heavily shaped by machine assistance.

That does not mean every learner using AI is cheating. It does mean that authenticity can no longer be assumed. It has to be designed, tested and verified much more deliberately.

This is where many providers are still behind the moment. Too much assessment practice in the sector continues to behave as though the pre-generative-AI world remains intact. Tasks are issued in the same way. Expectations are left vague. Rules around acceptable use are underdeveloped. Assessors are expected to make judgment calls without enough guidance. Then, when concerns arise, the system scrambles to respond after the fact.

That is not sustainable. If authenticity is going to remain central to assessment integrity, then RTOs need to stop treating it as a passive assumption and start treating it as an active design principle.

The old dependence on written evidence is now a major vulnerability

One of the clearest consequences of AI is the pressure it places on text-heavy assessment. Across the VET sector, written assignments, short-answer tasks, case study responses and report-style submissions have long been used to test knowledge and understanding. In the right context, they still have value. But their weaknesses are now much more visible.

Where tasks are predictable, generic or detached from a learner’s real context, AI can often perform well enough to compromise the educational value of the exercise. Even if the student has engaged in some real learning, the assessment outcome may no longer show what proportion of the work reflects the student’s actual reasoning, judgement or synthesis.

The danger here is not merely that AI can help students produce better text. The greater danger is that providers may continue treating text production itself as the key evidence of competence, even where the real capability to be assessed lies elsewhere. In such cases, the assessment begins to reward output rather than understanding.

That is why the sector needs a more honest conversation about what written evidence can and cannot prove in the current environment. Some written tasks will remain appropriate. Others will need to be redesigned. Some will need stronger supervision or oral verification. Others will need to be replaced altogether by methods that provide richer, more defensible evidence of skill and knowledge.

The issue is not whether written assessment has a place. The issue is whether the sector still understands its limits.

Assessment design now matters more than ever

In the AI era, assessment integrity cannot be rescued at the detection stage alone. It has to begin at the design stage.

That means asking tougher questions before a task is ever issued. What is this assessment actually intended to prove? Could a tool complete the visible product without the learner demonstrating the underlying skill? Does the task require reasoning, application, decision-making and contextual understanding, or does it mainly reward the ability to assemble polished language? Can authenticity be checked later if concerns arise? Is the method fit for an AI-enabled environment, not just a pre-AI one?

These questions are not optional refinements. They are becoming central to quality practice.

One of the strongest responses available to RTOs is to design assessments that privilege process as much as product. When learners must show how they arrived at an answer, why they made certain decisions, what evidence they relied upon, what changes they made after feedback, and how they would apply their learning in specific contexts, the assessor gains a much clearer view of actual understanding.

This does not make misuse impossible, but it makes superficial substitution much harder. It also shifts assessment closer to the kind of judgment-rich, context-sensitive thinking that matters in real workplaces.

The more assessment tasks ask students only to produce tidy text, the more exposed the system becomes. The more they require learners to demonstrate thinking, application, explanation and adaptation, the stronger the integrity of the evidence becomes.

Reflection, staging and dialogue are moving back to the centre

One of the most effective responses to AI pressure is to bring the learner’s process back into view.

Reflective components can be particularly valuable when they are designed properly. When learners are asked to explain their reasoning, justify their approach, discuss challenges they encountered, evaluate the quality of their own decisions or connect learning to specific experiences, the assessor gains access to something more revealing than a polished final answer. Reflection is not immune to AI misuse, but it becomes far more useful when it is anchored in the student’s own context, linked to real performance, and explored further through conversation.

Staged assessment can also strengthen integrity. Rather than relying on one finished submission, providers can assess development over time. Learners might submit a plan, then a draft, then a revised version, then discuss the rationale behind the final product. This creates a more visible trail of learning and makes it easier to identify whether the student is genuinely engaging with the task or simply presenting a final polished artefact disconnected from any observable developmental process.

Dialogue matters too. Structured oral questioning, short viva-style follow-up conversations, recorded explanation tasks and other forms of verbal verification are gaining renewed importance because they allow assessors to probe understanding directly. They do not need to replace all other forms of assessment, but they can play a critical role in confirming whether the learner can genuinely explain, defend and apply what has been submitted.

In an era of machine-generated fluency, human dialogue becomes a powerful integrity tool.

Practical assessment has become even more valuable

In many areas of VET, the strongest assessment evidence has always come from performance in real or simulated workplace contexts. Practical demonstrations, live interactions, applied tasks, observation and decision-making under authentic conditions remain among the most defensible ways to assess competence.

AI has only reinforced that reality.

Where a learner must physically perform a task, interact with others, solve problems in real time, respond to conditions as they emerge, or demonstrate applied skill in a realistic environment, the space for superficial AI substitution narrows considerably. That does not remove all integrity risks, but it changes the nature of the evidence in ways that are much more resistant to manipulation.

This is one of the reasons the current moment may ultimately produce something positive. It may push the sector back towards richer, more authentic, more practice-based assessment methods. Not because technology is bad, but because the pressure created by AI is forcing providers to ask whether their assessment methods were truly aligned with workplace capability in the first place.

Of course, practical assessment is not cost-free. It demands time, supervision, resources and access to suitable environments. For some RTOs, especially those under operational strain, expanding these approaches will be challenging. But that challenge cannot be used as an excuse for clinging to weak methods simply because they are easier to administer. Convenience is not the same thing as quality.

Detection tools will not solve this problem

There is understandable interest in AI detection tools, but the sector should be very cautious about treating them as a solution. Their reliability is inconsistent, their accuracy is contested, and the risk of false positives is too significant for providers to place heavy confidence in them.

More importantly, overreliance on detection can distract from the more important question of system design. A provider that depends on software to tell it whether integrity has been compromised is already operating too late in the process. By that point, the assessment method may already have failed to generate sufficiently robust evidence.

Detection tools may have a limited supplementary role in some contexts, but they are not a substitute for better task design, clearer student guidance, stronger verification and more defensible professional judgement. The sector needs to resist the temptation to outsource its integrity problem to software.

Assessment integrity will not be preserved by forensic suspicion alone. It will be preserved by building systems that remain credible even when technology becomes more sophisticated.

The sector needs clear rules on acceptable use

One of the most dangerous features of the current moment is ambiguity. Many learners are using AI without clear guidance. Many staff are using it too. In some organisations, the technology is everywhere, but the rules are nowhere.

That creates confusion, inconsistency and risk.

Students need to know when AI use is permitted, when it is restricted, when it is prohibited, and what disclosure is expected. They need to understand the difference between support and substitution. They need to know whether using AI for planning, brainstorming, grammar support, summarising, drafting, or idea generation is acceptable, and if so, under what conditions. They also need to know when the use of AI undermines the purpose of the task and crosses into misconduct.

Staff need the same clarity. An assessor cannot fairly judge misuse if the organisation has never defined acceptable use properly. A trainer cannot teach students responsibly if expectations differ from one unit to the next. A compliance manager cannot defend the organisation’s practice if informal habits have replaced formal policy.

The issue is not simply whether an RTO allows AI. The issue is whether its position is deliberate, intelligible and aligned with the competencies being assessed.

RTOs must confront the provider side of the integrity equation

A great deal of sector discussion still focuses on what students might do wrong. That is only half the story.

Providers themselves can weaken assessment integrity through careless adoption of AI. This may happen when assessment tools are generated quickly and published without sufficient review, when feedback is automated to save time but loses specificity and meaning, when staff rely on machine-produced content they do not properly evaluate, or when workflow efficiencies begin replacing academic judgement.

This is where AI becomes a leadership and governance issue.

Senior management teams need to know where AI is being used across the organisation, who approved that use, what controls are in place, what quality assurance applies, and where human judgement remains mandatory. They need to know whether staff are being trained or simply improvising. They need to know whether assessment systems have been redesigned for the AI era or merely patched around the edges.

The greatest long-term risk may not be overt cheating by learners. It may be the quiet normalisation of weak provider practices that hollow out assessment quality while appearing modern and efficient on the surface.

That kind of decline is much harder to spot quickly. But once it sets in, it can damage confidence in outcomes across the organisation.

Equity cannot be ignored

AI also creates a serious equity challenge. Access to tools, digital confidence and familiarity with prompt-based systems are not evenly distributed. Some learners are already more confident, better resourced and more digitally fluent than others. If providers build assessment systems that assume equal access to AI capability, they may unintentionally deepen disadvantage.

This does not mean the answer is to shut technology out. It means equity has to be considered deliberately. Learners need guidance, not just access. They need support in understanding the capabilities and limits of AI tools. They need digital literacy that includes critical judgment, not only functional use. And providers need to think carefully about whether AI-rich assessment conditions advantage some groups unfairly while leaving others behind.

Inclusion in the AI era is not just about allowing tools. It is about ensuring that technology use does not quietly distort fairness.

Leadership will determine whether the sector responds well or badly

No RTO can drift through this moment successfully. The organisations that respond best will be those that treat AI and assessment integrity as a strategic issue, not merely an operational nuisance.

That means investing in professional development. It means reviewing policies and procedures. It means examining current assessment tools honestly. It means involving academic, compliance, delivery and executive teams in the same conversation. It means asking what evidence of competence should look like now, not what it looked like five years ago.

It also means accepting that some long-standing practices may need to change. That can be uncomfortable. Staff may resist. Systems may need redesign. Resource implications may be real. But avoiding the issue is not a strategy. It is delayed.

The credibility of VET qualifications depends on the sector’s willingness to confront this challenge directly.

Conclusion

Artificial intelligence has forced Australian VET into a defining moment.

The old confidence that assessment integrity could be maintained through familiar methods, familiar assumptions and familiar controls is no longer enough. Authenticity can no longer be presumed. Written evidence can no longer be trusted as casually as before. Detection tools will not rescue a weak design. And policy silence will not protect providers from the consequences of ambiguity.

This is not a temporary disruption. It is a structural shift.

The sector now has a choice. It can treat AI as a problem to be managed at the edges while hoping existing systems hold together. Or it can use this moment to rethink assessment more deeply, strengthen authenticity, redesign evidence, clarify acceptable use, improve staff capability and rebuild confidence in what competence really looks like.

That is the challenge now facing every serious RTO.

The future of assessment integrity in Australian VET will not be defined by whether AI exists. It already does.

It will be defined by whether the sector is disciplined enough to design assessment systems that still deserve to be trusted.