patel.milanb
patel.milanb

Reputation: 5992

REGEX help needed in c#

enter image description hereI am very new to reg-ex and i am not sure whats going on with this one.... however my friend gave me this to solve my issue BUT somehow it is not working....

string: department_name:womens AND item_type_keyword:base-layer-underwear

reg-ex: (department_name:([\\w-]+))?(item_type_keyword:([\\w-]+))?

desired output: array OR group

C# Code

Regex regex = new Regex(@"(department_name:([\w-]+))?(item_type_keyword:([\w-]+))?");
Match match = regex.Match(query);
if (match.Success)
    if (!String.IsNullOrEmpty(match.Groups[4].ToString()))
        d1.ItemType = match.Groups[4].ToString();

this C# code only returns string array with 3 element

1: department_name:womens
2: department_name:womens
3: womens

somehow it is duplicating 1st and 2nd element, i dont know why. BUT its not return the other elements that i expect..

can someone help me please...

when i am testing the regex online, it looks fine to me...

http://fiddle.re/crvw1

Thanks

Upvotes: 2

Views: 113

Answers (5)

Menaka Sankar
Menaka Sankar

Reputation: 189

If you use any spliting with And Or , etc that you can use

(department_name:(.*?)) AND (item_type_keyword:(.*?)$)

•1: department_name:womens •2: womens •3: item_type_keyword:base-layer-underwear •4: base-layer-underwear

Upvotes: 1

vks
vks

Reputation: 67988

(?=(department_name:\w+)).*?:([\w-]+)|(?=(item_type_keyword:.*)$).*?:([\w-]+)

Try this.This uses a lookahead to capture then backtrack and again capture.See demo.

http://regex101.com/r/lS5tT3/52

Upvotes: 0

Jonny 5
Jonny 5

Reputation: 12389

Another idea using a lookahead for capturing and getting all groups in one match:

^(?!$)(?=.*(department_name:([\w-]+))|)(?=.*(item_type_keyword:([\w-]+))|)

as a .NET String

"^(?!$)(?=.*(department_name:([\\w-]+))|)(?=.*(item_type_keyword:([\\w-]+))|)"

test at regexplanet (click on .NET); test at regex101.com

(add m multiline modifier if multiline input: "^(?m)...)

Upvotes: 2

Jerry
Jerry

Reputation: 71598

You can use something like this to get the output you have in your question:

string txt = "department_name:womens AND item_type_keyword:base-layer-underwear";
var reg = new Regex(@"(?:department_name|item_type_keyword):([\w-]+)", RegexOptions.IgnoreCase);
var ms = reg.Matches(txt);
ArrayList results = new ArrayList();
foreach (Match match in ms)
{
    results.Add(match.Groups[0].Value);
    results.Add(match.Groups[1].Value);
}

// results is your final array containing all results
foreach (string elem in results)
{
    Console.WriteLine(elem);
}

Prints:

department_name:womens
womens
item_type_keyword:base-layer-underwear
base-layer-underwear

match.Groups[0].Value gives the part that matched the pattern, while match.Groups[1].Value will give the part captured in the pattern.

In your first expression, you have 2 capture groups; hence why you have twice department_name:womens appearing.

Once you get the different elements, you should be able to put them in an array/list for further processing. (Added this part in edit)

The loop then allows you to iterate over each of the matches, which you cannot exactly do with if and .Match() (which is better suited for a single match, while here I'm enabling multiple matches so the order they are matched doesn't matter, or the number of matches).

ideone demo


(?:
  department_name     # Match department_name
|                     # Or
  item_type_keyword   # Match item_type_keyword
)
:
([\w-]+)              # Capture \w and - characters

Upvotes: 3

Avinash Raj
Avinash Raj

Reputation: 174836

It's better to use the alternation (or logical OR) operator | because we don't know the order of the input string.

(department_name:([\w-]+))|(item_type_keyword:([\w-]+))

DEMO

String input = @"department_name:womens AND item_type_keyword:base-layer-underwear";
Regex rgx = new Regex(@"(?:(department_name:([\w-]+))|(item_type_keyword:([\w-]+)))");
foreach (Match m in rgx.Matches(input))
{
Console.WriteLine(m.Groups[1].Value);
Console.WriteLine(m.Groups[2].Value);
Console.WriteLine(m.Groups[3].Value);
Console.WriteLine(m.Groups[4].Value);
}

IDEONE

Upvotes: 2

Related Questions