I am going through a large website (1600+ pages) to make it pass Priority 1 W3C WAI. As a result, things like image tags need to have alt attributes.
What would be the
I wrote a simple code for this without Regex
let arr = []
$('img')
.filter(function() {
arr.push(this.alt)
})
document.write(arr.filter(a=>!a).length + ' img without alt tag')
Building on Mr.Black and Roberts126 answers:
/(<img(?!.*?alt=(['"]).*?\2)[^>]*)(>)/
This will match an img tag anywhere in the code which either has no alt tag or an alt tag which is not followed by ="" or ='' (i.e. invalid alt tags).
Breaking it down:
( : open capturing group
<img : match the opening of an img tag
(?! : open negative look-ahead
.*? : lazy some or none to match any character
alt=(['"]) : match an 'alt' attribute followed by ' or " (and remember which for later)
.*? : lazy some or none to match the value of the 'alt' attribute
\2) : back-reference to the ' or " matched earlier
[^>]* : match anything following the alt tag up to the closing '>' of the img tag
) : close capturing group
(>) : match the closing '>' of the img tag
If your code editor allows search and replace by Regex you can use this in combination with the replace string:
$1 alt=""$3
To find any alt-less img tags and append them with an empty alt tag. This is useful when using spacers or other layout images for HTML emails and the like.
Simple and effective:
<img((?!\salt=).)*?
This regex works for find <img>
tags missing the alt
attribute.
This is really tricky, because regular expressions are mostly about matching something that is there. With look-around trickery, you can do things like 'find A that is not preceded/followed by B', etc. But I think the most pragmatic solution for you wouldn't be that.
My proposal relies a little bit on your existing code not doing too crazy things, and you might have to fine-tune it, but I think it's a good shot, if you really want to use a RegEx-search for your problem.
So what I suggest would be to find all img tags, that can (but don't need to) have all valid attributes for an img-element. Whether that is an approach you can work with is for you to decide.
Proposal:
/<img\s*((src|align|border|height|hspace|ismap|longdesc|usemap|vspace|width|class|dir|lang|style|title|id)="[^"]"\s*)*\s*\/?>/
The current limitations are:
This worked for me.
^<img(?!.*alt).*$
This matches any string beginning with <img
that doesn't contain any number of characters before an alt attribute. It even works for src="<?php echo $imagename; ?>"
type of attributes.
This works in Eclipse:
<img(?!.*alt).*?>
I'm updating for Section 508 too!