tagged 【html】Questions- page 1

fedorqui

Asked: 2020-07-26 01:11:06 +0800 CST

Can you parse HTML with regular expressions?

26

Yesterday I translated the RegEx answer match open tags except XHTML self-contained tags with its famous code snippet:

You can't parse [X]HTML with regular expressions because HTML can't be parsed with regex. Regex is not a tool that can be used to properly parse HTML. Since I have already answered many HTML and regex questions, using regex will not allow you to render HTML. Regular expressions are a tool that is not sophisticated enough to understand the constructs used by HTML. HTML is not a regular language and therefore cannot be parsed using regular expressions. Regular expressions are not equipped to dissect HTML into its representative parts.

which ends with a final demo of broken HTML:

appears ~~, the~~ stinking regex infection will devour your HT ML parser, your application and your existence forever as mere Visual Basic or worse he comes don't fight he comes v̡im̡ie̶ne, ̕h̵u radiation destroying all brightness, tags of HTML filtering from your eyes, like a fragrant liquid, the song of parsing regular expressions is going to extinguish the voices of mortal man from the sphere I can see it you can see it 's beautiful or the ending extinguishing Men's lies EVERYTHING IS LOST EVERYTHING IS LOSTDO e l pon̷and he comes ~~he comes he comes~~ ~~the~~ íco r permeates everything M I FACE M I FACE ᵒh dos n o o NO NOO̼ OON Θ para los an*̶͑̾̾ ̅ͫ͏̙̤g͇̫͛͆̾ͫ̑͆ul͖͉̗̩̳̟o ̍ͫͥͨ ͨ Or they are rè̑ͧ̌aͨl̘̝̙̃ͤ͂̾̆es za̡͊͠͝lg red e sͮ̂҉̯͈͕̹̘̱ alȳ̳ ë͖̉l ͠p̯͍̭o̚ n̐y̡ ȩ̬̩̾͛ͪ̈̀͘l ̶̧̨̱̹̭̯ͧ̾ͬvien ȇ̴̟̟͙̞ͩ͌͝ "

I agree with your assertions:

HTML cannot be parsed with regex
regex are not sophisticated enough for this task
HTML is not a regular language and therefore cannot be parsed with regular expressions.

But then I received a comment from Mariano :

I know this is a joke that became famous. However, "HTML cannot be parsed with regex" is false. "not sophisticated enough" is false. "they are not equipped to dissect HTML" is false. "is not a regular language and therefore cannot be parsed using regular expressions" is flat out false. What is true is that it will give you headaches, because it is not a tool that fits the job... I hate this post.

And I was left wondering. Further searching brought me to a blog post by Jeff Atwood Parsing Html The Cthulhu Way , from 2009, where he starts off by talking about the response I just quoted, showing the sentiment that generated it. However, he parses the state of the matter and shows that it is not so clear that it cannot be done. He mentions a discussion in which experienced programmers defend its use in certain cases.

Therefore, the question is:

Can you parse HTML with regular expressions?
In which cases is it recommended to do it?
In which cases is it inadvisable?

_{You may have noticed that I use parse and parse interchangeably. I do it because one seems to be the translation of the other, but it is no less true that in Spanish-speaking environments the use of parsing is very widespread.}

Dev 200

Asked: 2020-04-01 04:53:41 +0800 CST

Why does the Inspector Character (�) appear in some data obtained from the Database?

74

I was dealing with the dilemma of converting accents and special characters from my system.

It happens that now some of the data obtained from the BBDD that have accents come out with this: �.

The strange thing is that there can be up to 20 data displayed with accents but only some come out, so SANCI�Nwhat could be happening?

The only way is to put this<meta http-equiv="content-type" content="text/html; charset=UTF-8">

But despite being on the forms, in some cases the �

Dynamically generated data gives that error

CONNECTION DATA:

config.ini

;<?php
;die(); // /* No modificar sino sabe lo que hace */
;/*
[database]
driver="mysql"
host="localhost"
port="3306"
schema="bbdd"
username="root"
password="pass" 
encode="utf8" 
;*/

Connection.php :

<?php

<?php
$file = 'config.ini.php';
$config = parse_ini_file($file, true);
$host = $config['database']['host'];
$user = $config['database']['username'];
$pass = $config['database']['password'];
$schema = $config['database']['schema'];
$encode = $config['database']['encode'];
class conexion extends mysqli

    {
    public

    function __construct($host, $user, $pass, $schema)
        {
        parent::__construct($host, $user, $pass, $schema);
        if (mysqli_connect_error())
            {
            die();
            }
        }
    }

$conexion = new conexion($host, $user, $pass, $schema);
mysqli_set_charset( $conexion, $encode);
?>

Alvaro Montoro

Asked: 2020-09-24 07:12:03 +0800 CST

What are ARIA attributes?

26

aria-Attributes beginning with and related to accessibility can be found on many web pages and in quite a few StackOverflow questions . For example:

<div class="text">
    <label id="tp1-label" for="nombre">Nombre:</label>
    <input type="text" id="nombre" name="nombre" size="20"
           aria-labelledby="tp1-label"
           aria-describedby="tp1"
           aria-required="true" />
    <div id="tp1" class="tooltip"
         role="tooltip"
         aria-hidden="true">El nombre es obligatorio</div>
</div>

What are attributes and what are they for aria-*? How many are there and which are the most important? And why should they be used?

lugomezb

Asked: 2020-09-23 07:25:47 +0800 CST

In an html I forgot to put the DOCTYPE so more user agent styles were added to a table... Why?

23

These styles were added to a table inside an html without DOCTYPE

Mosty Mostacho

Asked: 2020-12-02 11:06:56 +0800 CST

How can I horizontally center a div inside another div?

40

I need to horizontally center the div inner inside the outer based on this HTML and CSS code:

<div class="externo">
    <div class="interno">
    </div>
</div>

.interno {
    background-color: green;
    height: 20px;
    width: 50%;
}

.externo {
    background-color: red;
    width: 200px;
    padding: 20px;
}

JSFiddle: https://jsfiddle.net/fpvkcrkg/

Can you parse HTML with regular expressions?

Why does the Inspector Character (�) appear in some data obtained from the Database?

What are ARIA attributes?

In an html I forgot to put the DOCTYPE so more user agent styles were added to a table... Why?

How can I horizontally center a div inside another div?

HTML button that sends you to another page

Why do I get the error "Call to undefined function mysql_connect()"?

How to create an HTML button that works as a link?

How to separate a String in Java. How to use split()

Filter by dates in sql server

How to limit the number of decimal places in a double?

For each in JavaScript?

Position footer ALWAYS glued to the footer

Definitive Guide to Type Conversion in Java

How to properly compare Strings (and objects) in Java?

Questions[html]