Search & Discovery
Editorial Research

By · Published · Updated

The Quiet Vocabulary Shift That Search Engines Are Now Using to Find Your Content

A close look at how Schema.org's expanding property library—including some newly codified terms—connects publishers, educators, and search systems in ways that weren't possible a few years ago.

Key Takeaways · Quick Answers
What is Schema.org and why does it matter for educational content discovery?
Schema.org is a collaborative vocabulary project launched by Google, Microsoft, Yahoo!, and Yandex in 2011 that provides standardized properties for marking up structured data on web pages. For educational content, it matters because it allows publishers to describe courses, books, instructors, and learning resources in a language search engines can parse and compare—increasing the likelihood that learners find relevant material when they search.
What is structured data and how does it work with search engines?
Structured data is code added to web pages that follows Schema.org vocabulary to describe content in machine-readable terms. When search engines crawl a page, they can read the structured data to understand exactly what the content is, who created it, and how it relates to other content. Yoast SEO's 18.7 release, documented in May 2022, illustrated how tools build full knowledge graphs using Schema vocabulary to connect content entities and potentially earn rich search results.
What does Schema.org Version 30.0 include?
Schema.org Version 30.0 was released in March 2026 and represents the latest stage of a vocabulary that has been expanding since 2011. The version includes properties across dozens of domains, from generic creative works to specialized types like CDCPMDRecord for public health data, Vehicle for automotive content, and VideoGame for interactive entertainment—each adding precision to how specific content types can be described and discovered.
How do Web APIs like SubmitEvent.submitter relate to educational platforms?
The SubmitEvent.submitter property, documented on MDN and standardized across browsers since March 2022, allows developers to identify which specific form element triggered a submission. Educational platforms use forms for enrollment, quizzes, feedback, and assignment submissions. This API enables better analytics, improved accessibility tracking, and more precise measurement of learner engagement—all factors that contribute to platform quality and indirectly to discoverability.
Why are specialized Schema properties like cheatCode or numberOfAxles relevant to education?
These properties illustrate a broader principle: the more specific the vocabulary, the better it serves specialized audiences. Game-based learning, simulation-based training, and interactive educational content now have structured data options that describe their interactive components with precision. The presence of domain-specific properties signals that Schema.org is building vocabulary for how modern learning actually happens—increasingly interactive and embedded in digital environments rather than linear texts.

There is a moment every educator and publisher eventually confronts: the knowledge exists, the content is good, and yet the people who need it cannot seem to find it. The problem is rarely the material itself. It is the translation layer between what humans write and what machines read.

That translation layer has a name. It is called structured data, and at its center sits a vocabulary project called Schema.org—a collaborative initiative between Google, Microsoft, Yahoo!, and Yandex that began in 2011 and has since grown into the dominant standard for describing web content in ways search engines can parse, compare, and surface.

The vocabulary is not static. It expands. Properties are proposed, debated, adopted, and refined by a community of web developers, standards organizations, and publishers. And in recent years—culminating with the March 2026 release of Schema.org Version 30.0—the breadth of what this language can describe has widened in ways that are directly relevant to anyone publishing educational material online.

What Schema.org Actually Is

At its core, Schema.org provides a shared vocabulary for marking up structured data on web pages. When a publisher adds Schema markup to an article about a historical event, a course offering, or an instructional video, they are essentially handing search engines a machine-readable summary of what that content is, who created it, and how it relates to other things in the world.

The vocabulary organizes around types—Thing, CreativeWork, Course, Book, Event—and properties that describe specific attributes of those types. A Course might have properties like description, provider, hasCourseInstance, and educationalLevel. A Book might carry author, isbn, numberOfPages, and about.

This is not new. What is new is how granular the vocabulary has become—and how the expansion into specialized domains is creating sharper pathways for educational content to be recognized, categorized, and delivered to the right readers at the right moment.

The Property Landscape: Three Recent Additions

Schema.org Version 30.0, released in March 2026, reflects years of incremental growth. To understand what the vocabulary now enables, it helps to look at three properties from recent releases—each drawn from different domains but each illustrating the same underlying principle: the more precisely a machine can understand what something is, the better it can connect it to the person looking for it.

cheatCode: When Play Is Also Learning

The cheatCode property, documented in the Schema.org development version, describes cheat codes associated with video game content. It is expected to be a value of type CreativeWork and is used on types including VideoGame and VideoGameSeries. Schema.org's cheatCode property documentation notes that these represent "cheat codes to the game."

On the surface, this seems far from educational content. But consider the implications. Video games are increasingly used in educational settings—game-based learning, simulation-based training, interactive history lessons. The fact that Schema.org now has a vocabulary for describing interactive elements within game-based content means that educational game developers can mark up not just that a piece of content exists, but how it functions internally.

A training simulation for emergency responders, a historical strategy game used in a classroom, or a language-learning app built on game mechanics—all of these could now carry structured data that describes their interactive components with precision. The cheatCode property is a small signal of a larger trend: Schema.org is building vocabulary for the way modern learning actually happens, which is increasingly interactive, game-informed, and embedded in digital environments rather than linear texts.

cvdCollectionDate: Public Health Data and the Vocabulary of Trust

The cvdCollectionDate property is used within the CDCPMDRecord type to represent collection dates for patient count data. Its values are expected to be DateTime or Text, and it lives in what Schema.org describes as a "new" area—meaning implementation feedback is still being gathered and adoption is actively evolving. Schema.org's cvdCollectionDate property documentation describes it as related to "collectiondate—Date for which patient counts are reported."

This property is narrow in scope—it applies to a very specific record type within public health data. But its presence in the Schema vocabulary illustrates something broader: the effort to bring structured data into domains where data provenance and temporal accuracy matter enormously. In educational contexts, this matters too. A course syllabus marked with accurate session dates, a research dataset with clearly defined collection periods, or a historical archive with precise timelines—these all benefit from the same impulse that drove cvdCollectionDate into the vocabulary.

The connection to educational content discovery may not be obvious at first. But consider how search engines evaluate credibility. When a search engine can confirm not just that educational content exists, but that it is anchored to verifiable data with clear timestamps and authoritative sources, that content earns trust signals that raw keyword matching cannot provide.

numberOfAxles: Domain Specificity as Discovery Infrastructure

The numberOfAxles property, used on the Vehicle type, describes how many axles a vehicle has. It accepts values as Number or QuantitativeValue and draws on the Automotive Ontology Working Group for its definitions. Schema.org's numberOfAxles property documentation notes that typical unit code is C62, with acknowledgements to the Automotive Ontology Working Group.

This is a deeply specific property. It speaks to a vehicle, not a learner. It is not about education at all. But its existence in Schema.org illustrates a principle that applies to educational content: the more specialized a vocabulary becomes, the better it can serve specialized audiences. The Automotive Ontology Working Group contributed domain expertise to ensure that automotive data could be described with precision. That same model—domain experts contributing to Schema vocabulary in their areas of specialization—is precisely what educational publishers, learning scientists, and instructional designers need to engage with if they want structured data to reflect the nuance of how learning happens.

Vocabulary specificity is discovery infrastructure. When Schema.org had only generic CreativeWork for everything, search engines had to infer what a piece of content was. Now, with properties that describe axles, collection dates, and game codes, the inference is replaced by declaration. For educational content, this means the gap between "a PDF about biology" and "a peer-reviewed interactive unit on cellular respiration for 9th grade" can finally be bridged with structured data.

How Structured Data Reaches Search: The Yoast Connection

Schema.org is the vocabulary. But vocabulary only matters if it is used. That is where tools like Yoast SEO come in. In May 2022, Yoast released version 18.7 of their WordPress SEO plugin, a release focused heavily on Schema structured data improvements. Yoast's changelog for SEO 18.7 described how structured data helps publishers "talk directly to search engines in a language it understands."

The changelog noted: "Schema enables you to describe all those different parts and entities and how all of these parts connect." That is a quiet but significant reframing. Schema is not just about getting a rich snippet. It is about building a graph—a connected representation of how content, authors, organizations, images, and related concepts relate to one another.

For educational publishers, this graph is the discovery layer. When Yoast SEO builds a full structured data graph for a site—connecting course pages to instructor profiles to book reviews to video content—it is constructing a machine-readable map of everything the publisher offers. Search engines can traverse that map. They can understand that a particular video belongs to a specific course taught by a specific educator, which is part of a larger program, which was reviewed in a specific publication.

The Yoast release also improved how images are marked up for Person and Organization types, including the logo attribute for organizations. This matters for educational publishers because author identity and institutional affiliation are critical trust signals. A reader searching for "introduction to constitutional law" benefits when search engines can connect that course to a recognized instructor, a known institution, and verified credentials—all surfaced through structured data.

The Browser Layer: Web APIs and Structured Data

Beneath the Schema vocabulary and the SEO tools sits another layer that shapes how educational content is experienced and discovered: the browser itself. Web APIs—programming interfaces built into browsers—handle the mechanics of how content is displayed, interacted with, and transmitted.

Two MDN-documented properties illustrate this layer. The fullscreenElement read-only property of the ShadowRoot interface, as documented by MDN's documentation of ShadowRoot.fullscreenElement, "returns the element within the shadow tree that is currently displayed in full screen mode, or null if there is no full screen element." The documentation notes limited availability—this feature does not work in some of the most widely-used browsers.

The submitter property of the SubmitEvent interface, documented by MDN's SubmitEvent.submitter documentation, "specifies the submit button or other element that was invoked to cause the form to be submitted." It is baseline—widely available across devices and browser versions since March 2022.

These APIs may seem distant from educational content discovery. But consider: educational platforms rely heavily on interactive forms—enrollment applications, quizzes, feedback surveys, assignment submissions. The SubmitEvent.submitter property, now standardized across browsers, allows developers to track exactly which interactive element triggered a submission, enabling better analytics, improved accessibility, and more precise measurement of learner engagement. The ShadowRoot.fullscreenElement property, meanwhile, powers immersive learning experiences—video lectures, document viewers, interactive simulations—that can expand to full-screen mode while maintaining component isolation.

These are not discovery features in the traditional sense. But they are experience features. And in an era where search engines increasingly factor user experience signals into rankings, the technical foundations that enable smooth, accessible, well-structured interactive learning experiences are themselves part of the discovery equation.

What This Means for Educational Publishers

The cumulative picture is this: Schema.org's vocabulary is expanding to cover more specific domains, major publishing tools like Yoast SEO are building structured data graphs that connect educational content to its context, and browser APIs are enabling richer interactive experiences that keep learners engaged—all of which feeds into how search engines evaluate, rank, and surface educational material.

For publishers and educators, the practical implication is straightforward. Structured data is no longer optional or merely technical. It is the infrastructure through which educational content declares its identity to the systems that learners use to find it. A course without Schema markup is a book without a title page in a library that no one can alphabetize.

Schema.org Version 30.0, with its March 2026 release date, represents a vocabulary that has grown sophisticated enough to describe not just that content exists, but how it functions, who made it, what it belongs to, and how it connects to other content. The properties examined here—cheatCode, cvdCollectionDate, numberOfAxles—each illustrate a domain where precision in description enables precision in discovery.

Educational content has historically been difficult to mark up precisely because learning is complex. A course is not just a document. It has sessions, instructors, prerequisites, learning objectives, assessments, and credentials. Schema.org is gradually building the vocabulary to describe that complexity. The question for publishers is whether they are using that vocabulary—and using it well enough that search engines can understand what they offer.

Why This Matters for ReadersOpinions Readers

If you are a reader looking for the best book on a subject, the most relevant course from an instructor you trust, or the most up-to-date framework for understanding a concept, structured data is part of what makes that search work. When you type a question into a search engine and receive a direct answer—a definition, a comparison table, a list of steps—that answer was almost certainly assembled using Schema markup.

For readers researching practitioners, frameworks, books, and ideas, understanding structured data is not about becoming a web developer. It is about understanding the infrastructure that connects the content you seek with the systems that surface it. Schema.org is one layer of that infrastructure, and it is growing more capable every release.

The expansion of properties into specialized domains—automotive, public health, gaming, interactive forms—signals a future where virtually any kind of knowledge can be described with enough precision for machines to help humans find it. Educational content, with its inherent complexity and its outsized importance to individual and societal outcomes, stands to benefit enormously from that expansion—if publishers continue to do the work of marking it up.

Where to Read Further

The vocabulary itself is publicly documented and freely available. The best place to start is the official Schema.org documentation, where properties are organized by type and use case, with implementation notes and community feedback visible for each entry. Version 30.0 reflects the current state of that ongoing conversation.

For a practitioner's view of how structured data connects content to search, the Yoast SEO 18.7 changelog remains a useful case study in how major publishing tools translate Schema vocabulary into searchable, graph-connected content.

For the browser-level APIs that power interactive educational experiences, MDN's documentation of SubmitEvent.submitter and MDN's documentation of ShadowRoot.fullscreenElement offer clear, technical explanations with compatibility notes and code examples.

These sources together illustrate that structured data is not a single tool or a single standard. It is an ecosystem—a vocabulary, a set of implementation tools, a browser layer, and a community of publishers—all working toward the same goal: helping the right content reach the right reader at the right moment.

Sources reviewed

Atlas Research Network