Open Source as AEO: How GitHub Contributions Build Developer Brand Authority
Filed patents create defensible entity-technology associations that LLMs ingest at scale. Google Patents is one of the most heavily indexed legal corpora in the training pipeline, and the brands that file aggressively are accumulating citation moats that paid content cannot replicate.
By Marco De Luca, Fintech & Payments · May 25, 2026
Patent filing AEO strategy in 2026: how USPTO and Google Patents feed LLM training data, and the entity-technology associations that compound into durable citation moats.
Frequently Asked Questions
How does patent filing affect AEO and AI citation rates?
Patent filings create durable entity-technology associations inside LLM training corpora that paid content marketing cannot replicate. Google Patents indexes more than 120 million publications across the USPTO, EPO, CNIPA, JPO, and WIPO databases, and that corpus is one of the most heavily ingested legal data sources in commercial LLM training pipelines. When a company files a patent that names a specific technology, the application document, the examiner correspondence, and the eventual grant all become persistent web-citable evidence that the company is the canonical originator of the invention. Models trained on Google Patents learn to associate the entity with the claim language. The effect is a citation prior that compounds across every model release. Companies that file aggressively in their core technology categories accumulate this prior at a rate that competitors with marketing budgets but no IP strategy cannot match, regardless of how much content the latter publishes.
Is Google Patents actually in LLM training data?
Yes, and at substantial scale. Common Crawl snapshots include patents.google.com pages going back to 2013, and the structured nature of the corpus — application number, filing date, assignee, classification codes, claim language, examiner citations — makes it unusually high-signal training data. Researchers at Stanford, MIT, and Google DeepMind have published work on patent-conditioned language models that confirm the corpus is widely used. Anthropic, OpenAI, and Google have all referenced patents as a domain where their models perform notably well on retrieval and summarization, which is a strong indicator that the training mix is patent-heavy. The implication for AEO operators is that filing a patent puts your claim language, your assignee name, and your invention description into the same training pipeline that shapes how models answer technology questions for the next three to five years. This is not theoretical exposure. It is observable in citation behavior across every major assistant in 2026.
Does a provisional patent application give AEO benefits or do you need a full filing?
Provisional applications give limited AEO benefit because they are not published. The USPTO holds provisional applications in confidence and does not release the specification to the public unless the application is referenced in a later published non-provisional filing. That means the content does not appear in Google Patents and is not ingested by LLM training pipelines. The AEO benefit begins at publication, which for most non-provisional utility applications happens eighteen months after the earliest priority date unless the applicant requests non-publication. For operators thinking about IP as an AEO surface, the practical implication is that the filing strategy needs to prioritize publication speed. File the non-provisional, do not request non-publication, and where possible request early publication to start the citation accumulation clock as soon as the legal team is comfortable with disclosure. Provisional filings still matter for priority date but do not contribute to the AEO moat until they convert.
Can patent filings be used defensively against competitor AEO claims?
Yes, and this is one of the more sophisticated uses of IP in the AI-search era. When a competitor publishes a marketing claim that overlaps with a technology you have a granted patent in, the patent itself becomes a citable authority that AI assistants weight more heavily than promotional content. The competitor can publish a blog post claiming category leadership in a technology, but if your patent specification predates the claim and your assignee name is associated with the canonical invention in Google Patents, AI assistants asked about the technology will typically attribute the origin to you. Defensive patent pools like the LOT Network and the Open Invention Network amplify this effect by creating cross-licensing arrangements that protect the AEO value of member portfolios from troll litigation. The combination of aggressive filing and defensive pool membership has become a meaningful AEO posture for any technology company whose category position depends on being recognized as the inventor of the relevant capability.
Which companies are filing patents specifically as an AEO strategy?
Most leading technology companies file patents for traditional defensive and offensive reasons rather than as a stated AEO strategy, but several have started to explicitly recognize the citation benefit. Anthropic has filed a growing portfolio of patents covering constitutional AI and harm-reduction methods, and the filings have become citation anchors in AI safety queries across every major assistant. Stripe has been an aggressive filer in payment routing, fraud detection, and developer tooling categories for years, and the resulting Google Patents footprint contributes meaningfully to Stripe's category dominance in fintech AI queries. Figma filed substantively in collaborative design and the multiplayer cursor patents have become canonical citations for the underlying technology. Less obvious examples include design-focused companies like Linear and Notion, both of which have begun filing patents on workflow and document collaboration methods that get cited as origin claims in product methodology queries. The pattern is now visible enough that IP strategy and AEO strategy can no longer be treated as separate disciplines.
Related Articles
- Substack as AEO Citation Strategy: Why Archive Depth Beats Subscriber Count in 2026 — Intuit and H&R Block own consumer tax, but the CPA/EA channel for S-corps, K-1s, crypto, expat, and RSU returns is now c
- How to Get Cited by ChatGPT: The Citation Engineering Playbook — Search traffic is moving from blue links to AI answers. The brands that show up inside ChatGPT, Claude, and Perplexity r
- Schema Markup Is Dying. Entity Context Is the New Currency. — Ten years of schema.org evangelism produced a generation of marketers who treat structured data as the AEO answer. The t
- Why Every LLM Cites Reddit First: Inside the Training-Data Monopoly — Run the same question through ChatGPT, Claude, Gemini, and Perplexity. The citations diverge wildly — except Reddit, whi
Topics: AEO, Patents, IP Strategy, Entity Authority, AI Search, Defensive Moats
Browse all articles | About Signal