Exact identifier matching is fast and certain — when the identifier is present. Most catalogs are not that clean.
For non-coded products we embed titles and images, then score each match by confidence. Low-confidence matches go to a review queue.
Confirmed matches become training signal, so the pipeline gets sharper the longer it runs.