Boris Callens
Boris Callens

Reputation: 93357

Case insensitive 'Contains(string)'

Is there a way to make the following return true?

string title = "ASTRINGTOTEST";
title.Contains("string");

There doesn't seem to be an overload that allows me to set the case sensitivity. Currently I UPPERCASE them both, but that's just silly (by which I am referring to the i18n issues that come with up- and down casing).

UPDATE

This question is ancient and since then I have realized I asked for a simple answer for a really vast and difficult topic if you care to investigate it fully.

For most cases, in mono-lingual, English code bases this answer will suffice. I'm suspecting because most people coming here fall in this category this is the most popular answer.

This answer however brings up the inherent problem that we can't compare text case insensitive until we know both texts are the same culture and we know what that culture is. This is maybe a less popular answer, but I think it is more correct and that's why I marked it as such.

Upvotes: 3480

Views: 1291400

Answers (27)

Fabian Bigler
Fabian Bigler

Reputation: 10915

OrdinalIgnoreCase, CurrentCultureIgnoreCase or InvariantCultureIgnoreCase?

Here are some recommendations about when to use which one:

Dos

  • Use StringComparison.OrdinalIgnoreCase for comparisons as your safe default for culture-agnostic string matching.
  • Use StringComparison.OrdinalIgnoreCase comparisons for increased speed.
  • Use StringComparison.CurrentCulture-based string operations when displaying the output to the user.
  • Switch current use of string operations based on the invariant culture to use the non-linguistic StringComparison.Ordinal or StringComparison.OrdinalIgnoreCase when the comparison is linguistically irrelevant (symbolic, for example).
  • Use ToUpperInvariant rather than ToLowerInvariant when normalizing strings for comparison.

Don'ts

  • Use overloads for string operations that don't explicitly or implicitly specify the string comparison mechanism.
  • Use StringComparison.InvariantCulture-based string operations in most cases; one of the few exceptions would be persisting linguistically meaningful but culturally-agnostic data.

Based on these rules you should use:

string title = "STRING";
if (title.IndexOf("string", 0, StringComparison.[YourDecision]) != -1)
{
    // The string exists in the original
}

whereas [YourDecision] depends on the recommendations from above.

Upvotes: 37

Mohamed Salah
Mohamed Salah

Reputation: 328

new version of .net has the feature to ignore the case

examplestring.Contains("exampleSTRING", StringComparison.OrdinalIgnoreCase)

Upvotes: 3

cabiste
cabiste

Reputation: 517

Well i came accross this post so i decided to make a benchmark of some of the popular answers and in short JaredPar's answer is the fastest with 0 memory allocation and Colonel Panic's answer is the slowest

Benchmark Info

  • Date: 2023/05
  • .NET SDK: 7.0.103
  • BenchmarkDotNet: 0.13.5
  • CPU: i3-8100
  • OS: Arch Linux

Code Used

[MemoryDiagnoser]
public class StringContains
{
    [Params("How to install Arch Linux?")]
    public string Phrase { get; set; }
    [Params("How to", "arch", "blazor", "random long string to see if it effects the time needed")]
    public string search { get; set; }

    [Benchmark(Baseline = true)]
    public bool Contains() =>
        Phrase.Contains(search, System.StringComparison.CurrentCultureIgnoreCase);

    [Benchmark]
    public bool toUpper() =>
        Phrase.ToUpper().Contains(search.ToUpper());

    [Benchmark]
    public bool toLower() =>
        Phrase.ToLower().Contains(search.ToLower());

    [Benchmark]
    public bool IndexeOf() =>
        Phrase.IndexOf(search, StringComparison.OrdinalIgnoreCase) >= 0;

    [Benchmark]
    public bool CultureCompareInfo()
    {
        var culture = new CultureInfo("en-US");
        return culture.CompareInfo.IndexOf(Phrase, search, CompareOptions.IgnoreCase) >= 0;
    }
}

Results

i deleted some columns because they don't really matter

Method Phrase search Mean Ratio Gen0 Allocated
Contains How t(...)inux? [26] How to 46.887 ns 1.00 - -
toUpper How t(...)inux? [26] How to 88.386 ns 1.89 0.0381 120 B
toLower How t(...)inux? [26] How to 87.196 ns 1.86 0.0381 120 B
IndexeOf How t(...)inux? [26] How to 19.730 ns 0.42 - -
CultureCompareInfo How t(...)inux? [26] How to 166.691 ns 3.56 0.0560 176 B
Contains How t(...)inux? [26] arch 98.794 ns 1.00 - -
toUpper How t(...)inux? [26] arch 86.692 ns 0.88 0.0356 112 B
toLower How t(...)inux? [26] arch 70.534 ns 0.71 0.0254 80 B
IndexeOf How t(...)inux? [26] arch 26.405 ns 0.27 - -
CultureCompareInfo How t(...)inux? [26] arch 219.527 ns 2.22 0.0560 176 B
Contains How t(...)inux? [26] blazor 118.889 ns 1.00 - -
toUpper How t(...)inux? [26] blazor 83.605 ns 0.70 0.0381 120 B
toLower How t(...)inux? [26] blazor 67.559 ns 0.57 0.0254 80 B
IndexeOf How t(...)inux? [26] blazor 13.209 ns 0.11 - -
CultureCompareInfo How t(...)inux? [26] blazor 229.810 ns 1.93 0.0560 176 B
Contains How t(...)inux? [26] rando(...)eeded [55] 95.442 ns 1.00 - -
toUpper How t(...)inux? [26] rando(...)eeded [55] 113.243 ns 1.19 0.0688 216 B
toLower How t(...)inux? [26] rando(...)eeded [55] 86.116 ns 0.90 0.0254 80 B
IndexeOf How t(...)inux? [26] rando(...)eeded [55] 7.380 ns 0.08 - -
CultureCompareInfo How t(...)inux? [26] rando(...)eeded [55] 217.331 ns 2.28 0.0560 176 B

Legends

Phrase : Value of the 'Phrase' parameter

search : Value of the 'search' parameter

Mean : Arithmetic mean of all measurements

Ratio : Mean of the ratio distribution ([Current]/[Baseline])

Gen0 : GC Generation 0 collects per 1000 operations

Allocated : Allocated memory per single operation (managed only, inclusive, 1KB = 1024B)

1 ns : 1 Nanosecond (0.000000001 sec)

Upvotes: 5

mkchandler
mkchandler

Reputation: 4758

You can use IndexOf() like this:

string title = "STRING";

if (title.IndexOf("string", 0, StringComparison.OrdinalIgnoreCase) != -1)
{
    // The string exists in the original
}

Since 0 (zero) can be an index, you check against -1.

Microsoft .NET Documentation:

The zero-based index position of the value parameter from the start of the current instance if that string is found, or -1 if it is not. If value is Empty, the return value is startIndex.

Upvotes: 279

dashrader
dashrader

Reputation: 357

You can use a string comparison parameter (available from .NET Core 2.1 and above) String.Contains Method.

public bool Contains (string value, StringComparison comparisonType);

Example:

string title = "ASTRINGTOTEST";
title.Contains("string", StringComparison.InvariantCultureIgnoreCase);

Upvotes: 15

Ben
Ben

Reputation: 35653

The top-rated several answers are all good and correct in their own ways, I write here to add more information, context, and perspective.

For clarity, let us consider that string A contains string B if there is any subsequence of codepoints in A which is equal to B. If we accept this, the problem is reduced to the question of whether two strings are equal.

The question of when strings are equal has been considered in detail for many decades. Much of the present state of knowledge is encapsulated in SQL collations. Unicode normal forms are close to a proper subset of this. But there is more beyond even SQL collations.

For example, in SQL collations, you can be

  • Strictly binary sensitive - so that different Unicode normalisation forms (e.g. precombined or combining accents) compare differently.

    For example, é can be represented as either U+00e9 (precombined) or U+0065 U+0301 (e with combining acute accent).

    Are these the same or different?

  • Unicode normalised - In this case the above examples would be equal to each other, but not to É or e.

  • accent insensitive, (for e.g. Spanish, German, Swedish etc. text). In this case U+0065 = U+0065 U+0301 = U+00e9 = é = e

  • case and accent insensitive, so that (for e.g. Spanish, German, Swedish etc. text). In this case U+00e9 = U+0065 U+0301 = U+00c9 = U+0045 U+0301 = U+0049 = U+0065 = E = e = É = é

  • Kanatype sensitive or insensitive, i.e. you can consider Japanese Hiragana and Katakana as equivalent or different. The two syllabaries contain the same number of characters, organised and pronounced in the (mostly) the same way, but written differently and used for different purposes. For example katakana are used for loan words or foreign names, but hiragana are used for children's books, pronunciation guides (e.g. rubies), and where there is no kanji for a word (or perhaps where the writer does not know the kanji, or thinks the reader may not know it).

  • Full-width or half-width sensitive - Japanese encodings include two representations of some characters for historical reasons - they were displayed at different sizes.

  • Ligatures considered equivalent or not: See https://en.wikipedia.org/wiki/Ligature_(writing)

    Is æ the same as ae or not? They have different Unicode encodings, as do accented characters, but unlike accented characters they also look different.

    Which brings us to...

  • Arabic presentation form equivalence

    Arabic writing has a culture of beautiful calligraphy, where particular sequences of adjacent letters have specific representations. Many of these have been encoded in the Unicode standard. I don't fully understand the rules, but they seem to me to be analogous to ligatures.

  • Other scripts and systems: I have no knowledge whatsoever or Kannada, Malayalam, Sinhala, Thai, Gujarati, Tibetan, or almost all of the tens or hundreds of scripts not mentioned. I assume they have similar issues for the programmer, and given the number of issues mentioned so far and for so few scripts, they probably also have additional issues the programmer ought to consider.

That gets us out of the "encoding" weeds.

Now we must enter the "meaning" weeds.

  • is Beijing equal to 北京? If not, is Bĕijīng equal to 北京? If not, why not? It is the Pinyin romanisation.

  • Is Peking equal to 北京? If not, why not? It is the Wade-Giles romanisation.

  • Is Beijing equal to Peking? If not, why not?

Why are you doing this anyway?

For example, if you want to know if it is possible that two strings (A and B) refer to the same geographical location, or same person, you might want to ask:

  • Could these strings be either Wade-Giles or Pinyin representations of a set of sequences of Chinese characters? If so, is there any overlap between the corresponding sets?

  • Could one of these strings be a Cyrillic transcription of a Chinese Character?

  • could one of these strings be a Cyrillic transliteration of the Pinyin romanisation?

  • Could one of these strings be a Cyrillic transliteration of a Pinyin romanisation of a Sinification of an English name?

Clearly these are difficult questions, which don't have firm answers, and in any case, the answer may be different according to the purpose of the question.

To finish with a concrete example.

  • If you are delivering a letter or parcel, clearly Beijing, Peking, Bĕijīng and 北京 are all equal. For that purpose, they are all equally good. No doubt the Chinese post-offices recognise many other options, such as Pékin in French, Pequim in Portuguese, Bắc Kinh in Vietnamese, and Бээжин in Mongolian.

Words do not have fixed meanings.

Words are tools we use to navigate the world, to accomplish our tasks, and to communicate with other people.

While it looks like it would be helpful if words like equality, Beijing, or meaning had fixed meanings, the sad fact is they do not.

Yet we seem to muddle along somehow.

TL;DR: If you are dealing with questions relating to reality, in all its nebulosity (cloudiness, uncertainty, lack of clear boundaries), there are basically three possible answers to every question:

  • Probably
  • Probably not
  • Maybe

Upvotes: 5

Mathieu Renda
Mathieu Renda

Reputation: 15356

.NET Core 2.0+ (including .NET 5.0+)

.NET Core has had a pair of methods to deal with this since version 2.0 :

  • String.Contains(Char, StringComparison)
  • String.Contains(String, StringComparison)

Example:

"Test".Contains("test", System.StringComparison.CurrentCultureIgnoreCase);

It is now officially part of the .NET Standard 2.1, and therefore part of all the implementations of the Base Class Library that implement this version of the standard (or a higher one).

Upvotes: 228

JaredPar
JaredPar

Reputation: 755307

You could use the String.IndexOf Method and pass StringComparison.OrdinalIgnoreCase as the type of search to use:

string title = "STRING";
bool contains = title.IndexOf("string", StringComparison.OrdinalIgnoreCase) >= 0;

Even better is defining a new extension method for string:

public static class StringExtensions
{
    public static bool Contains(this string source, string toCheck, StringComparison comp)
    {
        return source?.IndexOf(toCheck, comp) >= 0;
    }
}

Note, that null propagation ?. is available since C# 6.0 (VS 2015), for older versions use

if (source == null) return false;
return source.IndexOf(toCheck, comp) >= 0;

USAGE:

string title = "STRING";
bool contains = title.Contains("string", StringComparison.OrdinalIgnoreCase);

Upvotes: 3165

Colonel Panic
Colonel Panic

Reputation: 137722

To test if the string paragraph contains the string word (thanks @QuarterMeister)

culture.CompareInfo.IndexOf(paragraph, word, CompareOptions.IgnoreCase) >= 0

Where culture is the instance of CultureInfo describing the language that the text is written in.

This solution is transparent about the definition of case-insensitivity, which is language dependent. For example, the English language uses the characters I and i for the upper and lower case versions of the ninth letter, whereas the Turkish language uses these characters for the eleventh and twelfth letters of its 29 letter-long alphabet. The Turkish upper case version of 'i' is the unfamiliar character 'İ'.

Thus the strings tin and TIN are the same word in English, but different words in Turkish. As I understand, one means 'spirit' and the other is an onomatopoeia word. (Turks, please correct me if I'm wrong, or suggest a better example)

To summarise, you can only answer the question 'are these two strings the same but in different cases' if you know what language the text is in. If you don't know, you'll have to take a punt. Given English's hegemony in software, you should probably resort to CultureInfo.InvariantCulture, because it will be wrong in familiar ways.

Upvotes: 1585

Valentin Peta
Valentin Peta

Reputation: 49

Based on the existing answers and on the documentation of Contains method I would recommend the creation of the following extension which also takes care of the corner cases:

public static class VStringExtensions 
{
    public static bool Contains(this string source, string toCheck, StringComparison comp) 
    {
        if (toCheck == null) 
        {
            throw new ArgumentNullException(nameof(toCheck));
        }

        if (source.Equals(string.Empty)) 
        {
            return false;
        }

        if (toCheck.Equals(string.Empty)) 
        {
            return true;
        }

        return source.IndexOf(toCheck, comp) >= 0;
    }
}

Upvotes: 0

Udi Y
Udi Y

Reputation: 288

Similar to previous answers (using an extension method) but with two simple null checks (C# 6.0 and above):

public static bool ContainsIgnoreCase(this string source, string substring)
{
    return source?.IndexOf(substring ?? "", StringComparison.OrdinalIgnoreCase) >= 0;
}

If source is null, return false (via null-propagation operator ?.)

If substring is null, treat as an empty string and return true (via null-coalescing operator ??)

The StringComparison can of course be sent as a parameter if needed.

Upvotes: 5

Pradeep Asanka
Pradeep Asanka

Reputation: 411

As simple and works

title.ToLower().Contains("String".ToLower())

Upvotes: 19

shaishav shukla
shaishav shukla

Reputation: 368

if you want to check if your passed string is in string then there is a simple method for that.

string yourStringForCheck= "abc";
string stringInWhichWeCheck= "Test abc abc";

bool isContained = stringInWhichWeCheck.ToLower().IndexOf(yourStringForCheck.ToLower()) > -1;

This boolean value will return if the string is contained or not

Upvotes: 6

Christian Findlay
Christian Findlay

Reputation: 7712

Just to build on the answer here, you can create a string extension method to make this a little more user-friendly:

    public static bool ContainsIgnoreCase(this string paragraph, string word)
    {
        return CultureInfo.CurrentCulture.CompareInfo.IndexOf(paragraph, word, CompareOptions.IgnoreCase) >= 0;
    }

Upvotes: 8

Jed
Jed

Reputation: 10897

Alternative solution using Regex:

bool contains = Regex.IsMatch("StRiNG to search", Regex.Escape("string"), RegexOptions.IgnoreCase);

Upvotes: 161

Lav Vishwakarma
Lav Vishwakarma

Reputation: 1426

These are the easiest solutions.

  1. By Index of

    string title = "STRING";
    
    if (title.IndexOf("string", 0, StringComparison.CurrentCultureIgnoreCase) != -1)
    {
        // contains 
    }
    
  2. By Changing case

    string title = "STRING";
    
    bool contains = title.ToLower().Contains("string")
    
  3. By Regex

    Regex.IsMatch(title, "string", RegexOptions.IgnoreCase);
    

Upvotes: 26

takirala
takirala

Reputation: 2019

This is clean and simple.

Regex.IsMatch(file, fileNamestr, RegexOptions.IgnoreCase)

Upvotes: 35

Mr.B
Mr.B

Reputation: 3797

The trick here is to look for the string, ignoring case, but to keep it exactly the same (with the same case).

 var s="Factory Reset";
 var txt="reset";
 int first = s.IndexOf(txt, StringComparison.InvariantCultureIgnoreCase) + txt.Length;
 var subString = s.Substring(first - txt.Length, txt.Length);

Output is "Reset"

Upvotes: 3

Final Heaven
Final Heaven

Reputation: 134

public static class StringExtension
{
    #region Public Methods

    public static bool ExContains(this string fullText, string value)
    {
        return ExIndexOf(fullText, value) > -1;
    }

    public static bool ExEquals(this string text, string textToCompare)
    {
        return text.Equals(textToCompare, StringComparison.OrdinalIgnoreCase);
    }

    public static bool ExHasAllEquals(this string text, params string[] textArgs)
    {
        for (int index = 0; index < textArgs.Length; index++)
            if (ExEquals(text, textArgs[index]) == false) return false;
        return true;
    }

    public static bool ExHasEquals(this string text, params string[] textArgs)
    {
        for (int index = 0; index < textArgs.Length; index++)
            if (ExEquals(text, textArgs[index])) return true;
        return false;
    }

    public static bool ExHasNoEquals(this string text, params string[] textArgs)
    {
        return ExHasEquals(text, textArgs) == false;
    }

    public static bool ExHasNotAllEquals(this string text, params string[] textArgs)
    {
        for (int index = 0; index < textArgs.Length; index++)
            if (ExEquals(text, textArgs[index])) return false;
        return true;
    }

    /// <summary>
    /// Reports the zero-based index of the first occurrence of the specified string
    /// in the current System.String object using StringComparison.InvariantCultureIgnoreCase.
    /// A parameter specifies the type of search to use for the specified string.
    /// </summary>
    /// <param name="fullText">
    /// The string to search inside.
    /// </param>
    /// <param name="value">
    /// The string to seek.
    /// </param>
    /// <returns>
    /// The index position of the value parameter if that string is found, or -1 if it
    /// is not. If value is System.String.Empty, the return value is 0.
    /// </returns>
    /// <exception cref="ArgumentNullException">
    /// fullText or value is null.
    /// </exception>
    public static int ExIndexOf(this string fullText, string value)
    {
        return fullText.IndexOf(value, StringComparison.OrdinalIgnoreCase);
    }

    public static bool ExNotEquals(this string text, string textToCompare)
    {
        return ExEquals(text, textToCompare) == false;
    }

    #endregion Public Methods
}

Upvotes: 1

Tamilselvan K
Tamilselvan K

Reputation: 1221

if ("strcmpstring1".IndexOf(Convert.ToString("strcmpstring2"), StringComparison.CurrentCultureIgnoreCase) >= 0){return true;}else{return false;}

Upvotes: 3

cdytoby
cdytoby

Reputation: 879

Just like this:

string s="AbcdEf";
if(s.ToLower().Contains("def"))
{
    Console.WriteLine("yes");
}

Upvotes: 14

TarmoPikaro
TarmoPikaro

Reputation: 5243

This is quite similar to other example here, but I've decided to simplify enum to bool, primary because other alternatives are normally not needed. Here is my example:

public static class StringExtensions
{
    public static bool Contains(this string source, string toCheck, bool bCaseInsensitive )
    {
        return source.IndexOf(toCheck, bCaseInsensitive ? StringComparison.OrdinalIgnoreCase : StringComparison.Ordinal) >= 0;
    }
}

And usage is something like:

if( "main String substring".Contains("SUBSTRING", true) )
....

Upvotes: 7

FeiBao  飞豹
FeiBao 飞豹

Reputation: 781

One issue with the answer is that it will throw an exception if a string is null. You can add that as a check so it won't:

public static bool Contains(this string source, string toCheck, StringComparison comp)
{
    if (string.IsNullOrEmpty(toCheck) || string.IsNullOrEmpty(source))
        return true;

    return source.IndexOf(toCheck, comp) >= 0;
} 

Upvotes: 55

Casey
Casey

Reputation: 3353

The InStr method from the VisualBasic assembly is the best if you have a concern about internationalization (or you could reimplement it). Looking at in it dotNeetPeek shows that not only does it account for caps and lowercase, but also for kana type and full- vs. half-width characters (mostly relevant for Asian languages, although there are full-width versions of the Roman alphabet too). I'm skipping over some details, but check out the private method InternalInStrText:

private static int InternalInStrText(int lStartPos, string sSrc, string sFind)
{
  int num = sSrc == null ? 0 : sSrc.Length;
  if (lStartPos > num || num == 0)
    return -1;
  if (sFind == null || sFind.Length == 0)
    return lStartPos;
  else
    return Utils.GetCultureInfo().CompareInfo.IndexOf(sSrc, sFind, lStartPos, CompareOptions.IgnoreCase | CompareOptions.IgnoreKanaType | CompareOptions.IgnoreWidth);
}

Upvotes: 11

mr.martan
mr.martan

Reputation: 213

Use this:

string.Compare("string", "STRING", new System.Globalization.CultureInfo("en-US"), System.Globalization.CompareOptions.IgnoreCase);

Upvotes: 9

serhio
serhio

Reputation: 28586

I know that this is not the C#, but in the framework (VB.NET) there is already such a function

Dim str As String = "UPPERlower"
Dim b As Boolean = InStr(str, "UpperLower")

C# variant:

string myString = "Hello World";
bool contains = Microsoft.VisualBasic.Strings.InStr(myString, "world");

Upvotes: 13

Ed Swangren
Ed Swangren

Reputation: 124760

You could always just up or downcase the strings first.

string title = "string":
title.ToUpper().Contains("STRING")  // returns true

Oops, just saw that last bit. A case insensitive compare would *probably* do the same anyway, and if performance is not an issue, I don't see a problem with creating uppercase copies and comparing those. I could have sworn that I once saw a case-insensitive compare once...

Upvotes: 105

Related Questions