Add AI Metadata to Your Lightroom Catalog
Use Lightroom mode to round-trip captions and keywords back into your catalog
Try PhotoScanr FreeFree to use • No sign-up required • Instant results
By Duncan Rawlinson · Updated
Lightroom users have lived through a few generations of "AI keywording." The first wave was a feature inside the catalog that mostly returned generic single-word tags. The second wave was external services that uploaded your previews to a black box and returned a CSV. The 2026 generation is finally the one worth using, and it changes how a working photographer can think about a 200,000-image catalog.
This guide is a practical look at the state of AI photo metadata for Lightroom users in 2026. It covers what current AI keywording can and cannot do, the difference between catalog metadata and IPTC metadata embedded in the file, the LR Transporter workflow that has become the default for round-tripping AI captions back into Lightroom, when to trust AI output and when to override it, and the recent Adobe Sensei changes that affect what you should and should not duplicate yourself.
If you have a backlog catalog full of "Untitled" and "DSC_0042," there has never been a better moment to fix it.
The current generation of multimodal models is genuinely good at describing the contents of a photograph. They get the subject, the setting, the activity, the dominant colors, the rough time of day, and often the geographic region. They write in complete sentences rather than dumping bag-of-keywords output. They handle multiple subjects in the same frame.
What they still get wrong is anything requiring outside knowledge that is not in the image. They cannot reliably guess the names of people unless given a face library. They will sometimes confidently invent location specifics. They cannot tell you which lens you used or what shutter speed produced a given motion blur unless EXIF is part of the input.
Knowing this, the right framing for 2026 is "AI does the description, photographer does the proper nouns." That division of labor is what makes the workflow fast.
Lightroom Classic remains the workhorse for serious archives, but its catalog has well-known limits worth respecting. Catalogs over a million images get sluggish on most hardware. Smart previews and 1:1 previews balloon disk use. The catalog file itself is a SQLite database and benefits from periodic optimization.
Practically, this means do not pour AI keywords into a single catalog all at once. Process in chunks of a few thousand images, write metadata back to the files, and let Lightroom re-read XMP rather than holding everything in working memory. The metadata browser becomes useful again once your keywords are populated, but it falls over if you try to populate everything in one operation.
Two settings worth checking. Make sure "Automatically write changes into XMP" is on, so the IPTC metadata you build in the catalog actually gets embedded in the files. And periodically run "Optimize Catalog" from the File menu to keep the SQLite database trim.
This is the distinction that trips up the most users. There are two places metadata can live for a Lightroom photo, and they are not the same place.
Stored in the .lrcat database. Fast to read and search inside Lightroom. Lost if the catalog is corrupted or if you switch to a new catalog without exporting metadata first. Not visible to other applications.
Embedded in the JPEG, TIFF, or accompanying XMP sidecar for raw files. Readable by Bridge, Photo Mechanic, Capture One, every CMS, and the Google image index. Travels with the file. Survives catalog rebuilds.
For AI metadata to be genuinely useful long-term, it has to make it into the file, not just the catalog. Anything that exists only in the catalog will not survive your next migration.
The Lightroom Transporter pattern has become the default way to get AI metadata into Lightroom in 2026. The shape is simple. Export JPEGs from a Lightroom collection. Run them through a metadata service. Re-import the metadata, not the JPEGs themselves, back into the original catalog entries.
PhotoScanr's Lightroom mode is built around this pattern. You export a collection at small or medium JPEG quality, since the AI just needs to see the image and full resolution wastes upload time. You drop the folder into PhotoScanr, run a batch, and download a ZIP with captions, titles, and keywords written into the JPEG IPTC headers. You then use Lightroom's "Read Metadata from Files" command on the originals to pull the keywords across.
There is one detail to watch. Lightroom matches by filename, so the AI-processed JPEGs need to keep their original filenames for the round trip to work. PhotoScanr's Lightroom mode preserves filenames by default. If you rename in your export preset, turn that off for the AI pass.
For raw files, the AI works on the JPEG export but the metadata gets pulled into the raw's XMP sidecar through Lightroom. This is the cleanest way to get AI keywords onto raw files without ever touching the raw itself.
The mental model that works best is to treat AI metadata as a strong first draft. Accept most of it. Override the small percentage that matters.
General descriptions. Subject identification. Color palette. Mood and atmosphere. Activity. Common location types like "beach" or "urban street." Generic keywords like "candid" or "portrait." These are the bulk of what you need and the AI is faster and more consistent than you would be.
Named people. Specific locations the AI cannot know. Project codes or client names that should be added as keywords. Copyright fields. Anything tied to a release, contract, or deliverable.
For trips and events where you know the locations and people in advance, pass that as grounding before the AI runs. This is more efficient than overriding after the fact. The AI uses your context as ground truth and you do not have to fix things one by one.
Adobe has continued to layer AI features into Lightroom. The People view, AI-powered subject and sky masking, and the search by visual similarity have all gotten more accurate. Lens Blur and Generative Remove are now standard.
Importantly, Adobe's Sensei keywording remains conservative. It is good at face recognition once you have trained it on a few people. It is good at common categories. It does not write rich captions and it does not generate alt text or SEO-friendly titles. That gap is exactly what external AI metadata services exist to fill.
The practical implication for 2026 is that Adobe handles the in-app organizational AI, while external services handle the descriptive and SEO-facing metadata. They are complementary, not competing. Use Sensei's face recognition to identify people, then add their names as grounding when you run a batch through PhotoScanr.
A dedicated mode that produces output sized for Lightroom round-trips. Captions, headlines, descriptions, and keyword sets are all written into IPTC fields that Lightroom recognizes. Filenames are preserved so the round trip works on import.
Save a style preset that matches how you describe your own work. Keywords stay in your terminology. Captions sound like your shop, not generic AI prose.
Pass trip itineraries, location names, and people lists as grounding before the batch runs. The AI uses these as ground truth.
PhotoScanr Pro handles 100 photos per day with batches up to 25, which fits incremental catalog cleanup. Studio handles 600 per day with batches of 100, which fits big back-catalog projects. The tier was renamed to Studio in v1.22.0; existing subscribers are unaffected.
For a one-time catalog cleanup of a back catalog with tens of thousands of images, top up with credits instead of upgrading your plan. 1,000 credits for $20, 5,000 for $90, or 10,000 for $160. Credits do not expire, so you can chip away at the catalog over months.
The 2026 version of AI photo metadata for Lightroom users finally works the way photographers wanted it to a decade ago. The AI handles the bulk descriptive work. You handle the proper nouns and the legal fields. Adobe Sensei handles the in-app intelligence. External services handle the SEO and IPTC. Everything writes back to the file so the metadata survives catalog rebuilds and migrations.
The biggest unlock is not speed, although speed is real. It is that a properly tagged archive becomes findable. You can search "foggy harbor at dawn 2019" and actually get the right photo, instead of scrolling through a folder named "Maine Trip" hoping to remember which evening it was.
For a step-by-step batch workflow that uses these ideas on a 1,000-image trip, see the photographer batch workflow guide. If you also do client deliverables, the event captions guide covers ZIP plus CSV handoffs.
Use Lightroom mode to round-trip captions and keywords back into your catalog
Try PhotoScanr FreeFree to use • No sign-up required • Instant results