Capitalize string of words except for prepositional words

Question

I am using the code below to take a string entered from a text box, and convert to capital case, except for words like (the, and, an, as, to, or, on) etc.

Issue #1: I want the first word of the string to always be capitalized regardless of what the word is.

Issue #2: The word spacing is not correct when the string is put back together.

xText = queryForHTML    
xTextSplit = split(xText, " ")

for each item in xTextSplit

    xWord = item

    if lcase(item) = "the" or lcase(item) = "and" or lcase(item) = "an" or lcase(item) = "as" or lcase(item) = "to" or lcase(item) = "is" or lcase(item) = "on" then
        xWord = lcase(item)
    end if

    xCompleteWord = xCompleteWord & " " & xWord

next

queryForHTML = xCompleteWord

MC ND · Accepted Answer

Option Explicit 

Dim originalString    
    originalString = "a saMple of String capiTalization (in some cases, not so obvious)"

Dim convertedString

Dim noiseWords
    noiseWords= "/a/abaft/aboard/about/above/absent/across/afore/after/against/along/alongside/amid" + _ 
                "/amidst/among/amongst/an/anenst/apropos/apud/around/as/aside/astride/at/athwart/atop" + _ 
                "/barring/before/behind/below/beneath/beside/besides/between/beyond/but/by/circa" + _ 
                "/concerning/despite/down/during/except/excluding/failing/following/for/forenenst/from" + _ 
                "/given/in/including/inside/into/like/mid/midst/minus/modulo/near/next/notwithstanding" + _ 
                "/o/of/off/on/onto/opposite/or/out/outside/over/pace/past/per/plus/pro/qua/regarding" + _ 
                "/round/sans/save/since/so/than/through/thru/throughout/thruout/till/times/to/toward" + _ 
                "/towards/under/underneath/unlike/until/unto/up/upon/versus/vs/via/vice/vis/with/within" + _ 
                "/without/worth/this/"

    Function correctCase(matchString,word,position,sourceString)
        word = LCase(word)
        If (position > 0) And (InStr(noiseWords,"/" & word & "/")>0) Then 
            correctCase = word
        Else
            correctCase = UCase(Left(word,1)) & Mid(word,2,Len(word)-1)
        End If
    End Function 

    With New RegExp
        .Pattern = "(\w+)"
        .Global = True 
        .IgnoreCase = True
        convertedString = .Replace(originalString,GetRef("correctCase"))
    End With 

    WScript.Echo originalString
    WScript.Echo convertedString

The basic idea is to use a regular expression matching any sequence of "word" characters ([a-zA-Z0-9]) and for each sequence, a function is called which receives as parameters the string matches, the capture group containing the word, the position in the string where it has been found and the full source string.

If the word is at position 0 it is capitalized. If the word is a "noise" word, it is lowercased, else, the word is capitalized.

Capitalize string of words except for prepositional words

Answers (2)

Related Questions