6:[["$","$Le",null,{}],["$","div",null,{"className":"min-h-screen bg-gray-100 p-6","children":[["$","$Lf",null,{}],["$","script",null,{"type":"application/ld+json","dangerouslySetInnerHTML":{"__html":"{\"@context\":\"https://schema.org\",\"@type\":\"QAPage\",\"mainEntity\":{\"@type\":\"Question\",\"name\":\"How to extract a page title\",\"text\":\"

I am trying to extract the page title from an HTML page

\\n\\n

cat index.html | grep -i \\\"title>\\\"| sed 's/<title>/ /i'| sed 's/<\\\\/title>/ /i'\\n

\\n\\n

The problem happens when some pages are written in one line! (believe me it happens)

\\n\\n

How do I solve that?

\\n\\n

Thanks!

\\n\",\"author\":{\"@type\":\"Person\",\"name\":\"Zenet\"},\"upvoteCount\":1,\"answerCount\":2,\"acceptedAnswer\":{\"@type\":\"Answer\",\"text\":\"

sed -n 's/.*<title>\\\\(.*\\\\)<\\\\/title>.*/\\\\1/ip;T;q'\\n

\\n\\n

From Linux Commands.

\\n\\n

1st result for Google: unix extract page title.

\\n\",\"author\":{\"@type\":\"Person\",\"name\":\"mcandre\"},\"upvoteCount\":1}}}"}}],["$","div",null,{"className":"bg-white shadow-md rounded-lg p-6 mb-6 relative","children":[["$","div",null,{"className":"absolute top-4 right-4 flex flex-wrap space-x-2","children":[["$","span","html",{"className":"bg-blue-600 text-white text-sm px-3 py-1 rounded-full","children":["$","$L10",null,{"href":"/discussion/tag/html/1","children":"html"}]}],["$","span","shell",{"className":"bg-blue-600 text-white text-sm px-3 py-1 rounded-full","children":["$","$L10",null,{"href":"/discussion/tag/shell/1","children":"shell"}]}]]}],["$","div",null,{"className":"flex items-center mb-4","children":[["$","img",null,{"src":"https://www.gravatar.com/avatar/95e59ecae00ae5d25886c7644baa940d?s=256&d=identicon&r=PG","alt":"Zenet","className":"w-16 h-16 rounded-full border"}],["$","div",null,{"className":"ml-4","children":[["$","a",null,{"href":"https://stackoverflow.com/users/262854/zenet","target":"_blank","rel":"noopener noreferrer","className":"text-lg font-semibold text-blue-600 hover:underline","children":"Zenet"}],["$","p",null,{"className":"text-sm text-gray-500","children":["Reputation: ",7411]}]]}]]}],["$","h1",null,{"className":"text-2xl font-bold text-gray-800 mb-4","children":"How to extract a page title"}],["$","p",null,{"className":"text-gray-700 mt-4","dangerouslySetInnerHTML":{"__html":"

I am trying to extract the page title from an HTML page

\n\n

cat index.html | grep -i \"title>\"| sed 's/<title>/ /i'| sed 's/<\\/title>/ /i'\n

\n\n

The problem happens when some pages are written in one line! (believe me it happens)

\n\n

How do I solve that?

\n\n

Thanks!

\n"}}],["$","div",null,{"className":"text-gray-600 text-sm mt-4","children":[["$","p",null,{"children":["Upvotes: ",1]}],["$","p",null,{"children":["Views: ",1269]}]]}]]}],["$","div",null,{"className":"container mx-auto","children":[["$","h2",null,{"className":"text-2xl font-semibold text-gray-800 mb-6","children":["Answers (",2,")"]}],[["$","div","3196426",{"className":"bg-white shadow-md rounded-lg p-6 mb-6","children":[["$","div",null,{"className":"flex items-center mb-4","children":[["$","img",null,{"src":"https://www.gravatar.com/avatar/c2618d986361c695497c1a875ea8da01?s=256&d=identicon&r=PG","alt":"ghostdog74","className":"w-12 h-12 rounded-full border"}],["$","div",null,{"className":"ml-4","children":[["$","a",null,{"href":"https://stackoverflow.com/users/131527/ghostdog74","target":"_blank","rel":"noopener noreferrer","className":"text-lg font-semibold text-blue-600 hover:underline","children":"ghostdog74"}],["$","p",null,{"className":"text-sm text-gray-500","children":["Reputation: ",342373]}]]}]]}],["$","p",null,{"className":"text-gray-700 mb-4","dangerouslySetInnerHTML":{"__html":"

this awk one liner works also for title that spans more than 1 line.

\n\n

$ cat file\n<html>\n    <title>How to extract a page\ntitle - Stack Overflow</title>\n    <link rel=\"stylesheet\" href=\"http://sstatic.net/so/all.css?v=4864b39b46cf\">\n    <link rel=\"shortcut icon\" href=\"http://sstatic.net/so/favicon.ico\">\n    <link rel=\"apple-touch-icon\" href=\"http://sstatic.net/so/apple-touch-icon.png\">\n</html>\n\n$ awk 'BEGIN{RS=\"</title>\"}/title/{gsub(\".*<title>\",\"\");print}' file\nHow to extract a page\ntitle - Stack Overflow\n

\n"}}],["$","div",null,{"className":"text-gray-600 text-sm","children":["$","p",null,{"children":["Upvotes: ",0]}]}]]}],["$","div","3195895",{"className":"bg-white shadow-md rounded-lg p-6 mb-6","children":[["$","div",null,{"className":"flex items-center mb-4","children":[["$","img",null,{"src":"https://www.gravatar.com/avatar/dfe88469b75efc87cbcbbbc2a975850a?s=256&d=identicon&r=PG","alt":"mcandre","className":"w-12 h-12 rounded-full border"}],["$","div",null,{"className":"ml-4","children":[["$","a",null,{"href":"https://stackoverflow.com/users/350106/mcandre","target":"_blank","rel":"noopener noreferrer","className":"text-lg font-semibold text-blue-600 hover:underline","children":"mcandre"}],["$","p",null,{"className":"text-sm text-gray-500","children":["Reputation: ",24602]}]]}]]}],["$","p",null,{"className":"text-gray-700 mb-4","dangerouslySetInnerHTML":{"__html":"

sed -n 's/.*<title>\\(.*\\)<\\/title>.*/\\1/ip;T;q'\n

\n\n

From Linux Commands.

\n\n

1st result for Google: unix extract page title.

\n"}}],["$","div",null,{"className":"text-gray-600 text-sm","children":["$","p",null,{"children":["Upvotes: ",1]}]}]]}]]]}],["$","div",null,{"className":"bg-white shadow-md rounded-lg p-6 mt-6","children":[["$","h2",null,{"className":"text-2xl font-semibold text-gray-800 mb-4","children":"Related Questions"}],["$","ul",null,{"className":"list-disc list-inside","children":[["$","li","11711339",{"className":"mb-2","children":["$","$L10",null,{"href":"/discussion/solution/11711339","className":"text-blue-600 hover:underline","children":"Getting Webpage Title, Img, Metadata info from Linux Terminal"}]}],["$","li","74133242",{"className":"mb-2","children":["$","$L10",null,{"href":"/discussion/solution/74133242","className":"text-blue-600 hover:underline","children":"Use shell script to copy a title tag in HTML file"}]}],["$","li","3833088",{"className":"mb-2","children":["$","$L10",null,{"href":"/discussion/solution/3833088","className":"text-blue-600 hover:underline","children":"Extract Title of a html file using grep"}]}],["$","li","31331582",{"className":"mb-2","children":["$","$L10",null,{"href":"/discussion/solution/31331582","className":"text-blue-600 hover:underline","children":"Get the content of title tag from a webpage with PHP"}]}],["$","li","574199",{"className":"mb-2","children":["$","$L10",null,{"href":"/discussion/solution/574199","className":"text-blue-600 hover:underline","children":"How do I extract an HTML title with Perl?"}]}],["$","li","717100",{"className":"mb-2","children":["$","$L10",null,{"href":"/discussion/solution/717100","className":"text-blue-600 hover:underline","children":"extract title tag from html"}]}],["$","li","12355323",{"className":"mb-2","children":["$","$L10",null,{"href":"/discussion/solution/12355323","className":"text-blue-600 hover:underline","children":"Extract title from HTML content"}]}],["$","li","17790990",{"className":"mb-2","children":["$","$L10",null,{"href":"/discussion/solution/17790990","className":"text-blue-600 hover:underline","children":"Extract title of HTML page without loading entire page"}]}],["$","li","3210755",{"className":"mb-2","children":["$","$L10",null,{"href":"/discussion/solution/3210755","className":"text-blue-600 hover:underline","children":"How to get web page title using html parser"}]}],["$","li","4914573",{"className":"mb-2","children":["$","$L10",null,{"href":"/discussion/solution/4914573","className":"text-blue-600 hover:underline","children":"How can I get the title of an HTML page using php?"}]}]]}]]}]]}],["$","$L11",null,{}],["$","$L12",null,{}],["$","$L13",null,{}],["$","$L14",null,{}],["$","$L15",null,{}]]

How to extract a page title

Answers (2)

Related Questions