What is a promise in Javascript?

Question

Asked: 2020-05-31 10:15:26 +0800 CST 2020-05-31 10:15:26 +0800 CST 2020-05-31 10:15:26 +0800 CST

Extract all texts between two delimiters within an html code

772

As the text says, I have a long html code, approximately 1000 lines, where there are lots of and , here is an example

<td class="textleft"><a href="/DIRECCION-URL.html">TEXTO DE LA URL</a></td><td>DESCRIPCION</td><td class="mobile-hidden">POBLACION</td></tr>

That pattern is repeated about 100 times in the entire html code, and from that code I would need to be able to literally extract this --> /URL-ADDRESS.html on the one hand, and URL TEXT on the other hand, into an associative array of course.

I have done several tests with preg_match_all in php but the only one that has worked for me returns values with http that are just the ones I want to omit.

I must admit that this code that I put is copied and that I have simply messed with it a bit to try to adapt it to what I need, but I cannot get it out

preg_match_all('#/[^,\s()<>]+(?:([\w\d]+)|([^,[:punct:]\s]|/))#', $htmlcontent , $results);

1 Answers

Voted

Jorge Arturo Juarez · Answer 1 · 2020-06-11T15:40:18+08:00

With xpath you can achieve it

with //td[@class='textleft']/a/@hrefyou get all the values of the href attribute ( @href), within all the links ( a), within all the cells [ td] that if the class attribute is 'textleft'[@class='textleft']

with //td[@class='textleft']/a/text()you get the text (/text())inside all the links ( /a), inside all the cells ( td) that its class attribute is 'textleft'[@class='textleft']

<?php
$string = '<tr><td class="textleft"><a href="/DIRECCION-URL.html">TEXTO DE LA URL</a></td><td>DESCRIPCION</td><td class="mobile-hidden">POBLACION</td></tr>
<tr><td class="textleft"><a href="/DIRECCION-URL.html">TEXTO DE LA URL</a></td><td>DESCRIPCION</td><td class="mobile-hidden">POBLACION</td></tr>
<tr><td class="textleft"><a href="/DIRECCION-URL.html">TEXTO DE LA URL</a></td><td>DESCRIPCION</td><td class="mobile-hidden">POBLACION</td></tr>';

$dom = new DomDocument;
$dom->loadHTML($string);
$xpath = new DomXPath($dom);
$urls = $xpath->query("//td[@class='textleft']/a/@href");
$textos=$xpath->query("//td[@class='textleft']/a/text()");
foreach ($urls as $i => $node) {
    echo "url: ", $node->nodeValue, "\n";
}
foreach ($textos as $i => $node) {
    echo "texto: ", $node->nodeValue, "\n";
}

?>

Extract all texts between two delimiters within an html code

HTML button that sends you to another page

Why do I get the error "Call to undefined function mysql_connect()"?

How to create an HTML button that works as a link?

How to separate a String in Java. How to use split()

Filter by dates in sql server

How to limit the number of decimal places in a double?

For each in JavaScript?

Position footer ALWAYS glued to the footer

Definitive Guide to Type Conversion in Java

How to properly compare Strings (and objects) in Java?