the_drow
the_drow

Reputation: 19181

How do I sanitize title URIs with PHP?

I am programming a blog and I want the URIs to be the title like the question title here in stackoverflow or like wordpress.
What are the rules for sanitizing a URI?
Is there an already made code in PHP that does this?

Thanks in advance,
Omer

Upvotes: 2

Views: 5647

Answers (4)

raspi
raspi

Reputation: 6122

Here's how drupal does it.

In case of site goes down:

<?php
function pathauto_cleanstring($string)
{
    $url = $string;
    $url = preg_replace('~[^\\pL0-9_]+~u', '-', $url); // substitutes anything but letters, numbers and '_' with separator
    $url = trim($url, "-");
    $url = iconv("utf-8", "us-ascii//TRANSLIT", $url); // TRANSLIT does the whole job
    $url = strtolower($url);
    $url = preg_replace('~[^-a-z0-9_]+~', '', $url); // keep only letters, numbers, '_' and separator
    return $url;
}

Upvotes: 6

Gumbo
Gumbo

Reputation: 655239

This might be the shortest way to replace any non alphanumeric character with a single hyphen:

trim(preg_replace('/[^a-z0-9-]+/', '-', strtolower($str)), '-')

Upvotes: 6

Will Morgan
Will Morgan

Reputation: 4490

Generally you'll want your URL to have only 0-9 and a-z, and make sure that everything is lowercase. Replace spaces with dashes (-), and strip the rest of the gibberish.

SO pretty much has it figured out.

Upvotes: 2

soulmerge
soulmerge

Reputation: 75704

Many CMS's have implemented something like that, the one of Wordpress has been posted in another question. You might be interested in the question about this technique in general, too.

Upvotes: 2

Related Questions