C++/CLI Code Snippet - Extract URLs

C++/CLI Code Snippet - Extract URLs

C++/CLI Code Snippet - Extract URLs

C++/CLI code snippet to extracts all the URLs from a string. ExtractURLs returns string array of URLs successful matches by iteratively applying a regular expression pattern to the input string.

Bookmark:

C++/CLI Code Snippet - Extract URLs

This .Net C++/CLI code snippet extracts all the URLs from a string. Data mining for URLs done by set of successful matches found by iteratively applying a regular expression pattern to the input string.

array<System::String^> ^ExtractURLs(System::String ^str)
{
    // match.Groups["name"].Value - URL Name
    // match.Groups["url"].Value - URI
    System::String ^RegexPattern = "<a.*?href=[\"'](?<url>.*?)[\"'].*?>(?<name>.*?)</a>"

    // Find matches.
    System::Text::RegularExpressions::MatchCollection ^matches = System::Text::RegularExpressions::Regex::Matches(str, RegexPattern, System::Text::RegularExpressions::RegexOptions::IgnoreCase);

    array<System::String^> ^MatchList = gcnew array<System::String^>(matches->Count);

    // Report on each match.
    int c = 0;
    for each (System::Text::RegularExpressions::Match ^match in matches)
    {
        MatchList[c] = match->Groups["url"]->Value;
        c++;
    }

    return MatchList;
}


C++/CLI Keywords Used:

  • Regex
  • Match
  • MatchCollection
  • RegexOptions::IgnoreCase
  • Regex::Matches

Code Snippet Information:

  • Applies To: .Net, C++/CLI, Data Mining, URL Extract, Regular Expression
  • Programming Language : C++/CLI

External Resources:

Leave a comment