I have the following Powershell script:
$oldCode = @\"
For Pete's sake, don't even think about using regex for HTML.
The problem you met is that reading a file will provide you an array of strings. Replace()
doesn't know about arrays, so you got to work it by hand. You could create a big string with -join
like so,
$fileContent = [System.Io.File]::ReadAllText($_.FullName)
$theOneString = $fileContent -join ' '
$theOneString.Replace($foo, $bar)
... But this will mess up your line breaks. Then again, you could reformat the string with HTML Tidy.
The manual way is to iterate the source array row by row. Until you find the <div>
, copy the contents into new destination array. After finding the replacable part, insert rest of the new stuff into the destination array. Keep reading and discarding the source array untill you find the </div>
and copy all the rest into the destination array. Finally save the destination array's contents and you are done.
I wouldn't use string replacements for modifying HTML code. To many things that could develop in unexpected directions. Try something like this:
$newCode = @"
<!-- nested divs and spans -->
<div id="contact-form">
<?php include "contact-form.php"; ?>
</div>
"@
Get-ChildItem '*.html' | % {
$html = New-Object -COM HTMLFile
$html.write([IO.File]::ReadAllText($_.FullName))
$html.getElementById('time_estimate').innerHTML = $newCode
[IO.File]::WriteAllText($_.FullName, $html.documentElement.outerHTML)
}
If needed you can can prettify the HTML by using Tidy:
$newCode = @"
<!-- nested divs and spans -->
<div id="contact-form">
<?php include "contact-form.php"; ?>
</div>
"@
[Reflection.Assembly]::LoadFile('C:\path\to\Tidy.dll') | Out-Null
$tidy = New-Object Tidy.DocumentClass
Get-ChildItem '*.html' | % {
$html = New-Object -COM HTMLFile
$html.write([IO.File]::ReadAllText($_.FullName))
$html.getElementById('time_estimate').innerHTML = $newCode
$tidy.ParseString($html.documentElement.outerHTML)
$tidy.SaveFile($_.FullName) | Out-Null
}
What version of PowerShell are you using? If you're using v3 or higher, try this:
ls *.html | foreach {
$fileContent = Get-Content $_.FullName -Raw
$newFileContent = $fileContent -replace $oldCode, $newCode
Set-Content -Path $_.FullName -Value $newFileContent
Write-Host "`r`n"
Write-Host "Processed - $($_.Name)...`r`n"
}