Jenna
Jenna

Reputation: 53

extracting substrings in C

I have the string: "foo$bar@baz"

I'm looking to write a C program which will extra all three sub-strings ("foo", "bar" and "baz") and put each into it's own string.

P.S. Don't worry, this is not homework.

Upvotes: 5

Views: 765

Answers (4)

john personna
john personna

Reputation: 442

Since this is straight C, it might be fun to revisit how strings are stored and terminated. Since you have one terminating character for each section, you can just make it into a true terminator ('\0') and leave the strings in place:

#include <stdio.h>
#include <stdlib.h>
#include <string.h>

int main(int argc, char **argv) {

char *s1,*s2,*s3, *test = "foo$bar@baz";
char *buf=(char *)malloc(100);
char *p,c;

strcpy(buf, test);

s1 = p = buf;
while(c = *p) {
  if (c == '$') { *p = '\0'; s2 = p+1; }
  if (c == '@') { *p = '\0'; s3 = p+1; }
  p++;
}

printf("s1 = %s\n",s1);
printf("s2 = %s\n",s2);
printf("s3 = %s\n",s3);

}

I wouldn't do this in production code, in this day and age. But way back when, doing one pass on the loop, and one copy for storage, would have been considered a big win.

Upvotes: 1

user224003
user224003

Reputation: 217

Strtok keeps a static buffer when tokenizing that gets overwritten when if called elsewhere with a new non-NULL string, which could break the functionality you're looking for. e.g. interwoven calls with different strings.

Instead, you can use strsep, which is like strtok, but you keep the temporary buffer yourself, in case you need to tokenize multiple strings, e.g. with the interwoven calls with different strings.

In small cases, this problem probably won't arise, but it can in larger projects.

EDIT: this isn't std c, so make sure you have this function before trying to use it. I know for certain that it is available on BSD, and possibly other unix's

EDIT: strsep and strtok_r appear to have the same functionality

Upvotes: 0

Yuval
Yuval

Reputation: 286

if it is not for homework :-) than strtok is not recommended, if you can't use C++ (why?) you should use strtok_r (reentrent version)

Upvotes: 1

monksy
monksy

Reputation: 14234

What you are looking for is strtok. It allows for you to set the delimiters as well.

Upvotes: 10

Related Questions