Remove Duplicate XML Nodes in PowerShell

Question

I have an XML file that looks like the following:



  
    MozillaFirefox
    31.3.0
    /Mozilla/Firefox/31.3.0.exe
  
    GoogleChrome
    35.7
    /Google/Chrome/35.7.msi
  
    MozillaFirefox
    33.4.0
    /Mozilla/Firefox/33.4.0.exe

Here is my current code:

#Load XML file into $catalogXML
[xml]$catalogXML = (Get-Content (C:	est.xml))

$softwareVersionsArray = $catalogXML.catalog.software

Which outputs this:

name             version      installer_location
----             -------      ------------------ 
MozillaFirefox   31.3.0       /Mozilla/Firefox/31.3.0.exe
GoogleChrome     35.7         /Google/Chrome/35.7.msi
MozillaFirefox   33.4.0       /Mozilla/Firefox/33.4.0.exe

I need assistance coding this so that any duplicates are removed and that the first entry is the one that is kept (i.e. Firefox 31.3.0 is displayed and Firefox 33.3.4 is removed). I've tried various XPath statements and Select -Unique filters to no avail. Thanks in advance for any help!

user4003407 · Accepted Answer

To get desired results, you can group elements by name: Group-Object name; and than select first element from each group: ForEach-Object {$_.Group[0]}.

$catalogXML.catalog.software|
Group-Object name|
ForEach-Object {$_.Group[0]}

Remove Duplicate XML Nodes in PowerShell

Answers (2)

Related Questions