What is a promise in Javascript?

Question

Mariano

Asked: 2020-12-02 12:55:26 +0800 CST 2020-12-02 12:55:26 +0800 CST 2020-12-02 12:55:26 +0800 CST

Validate an email in JavaScript that accepts all Latin characters

772

Ask

How to validate an e-mail that accepts all Latin characters?

By Latin characters I mean accented letters, ñ, ç, and all those used by languages like Spanish, Portuguese, Italian... Latin.

Context

The goal is to display an icon next to the text as the user types their email address.
I am not interested in accepting all valid cases. It was a design decision to cover only the most frequent emails. That is, letters (including accents and the like) and the symbols ._%+-.
I can use code from other sources, as long as they are popular (eg jQuery).

Code

document.getElementById('email').addEventListener('input', function() {
    campo = event.target;
    valido = document.getElementById('emailOK');
        
    emailRegex = /^[-\w.%+]{1,64}@(?:[A-Z0-9-]{1,63}\.){1,125}[A-Z]{2,63}$/i;
    //Se muestra un texto a modo de ejemplo, luego va a ser un icono
    if (emailRegex.test(campo.value)) {
      valido.innerText = "válido";
    } else {
      valido.innerText = "incorrecto";
    }
});

<p>
    Email:
    <input id="email">
    <span id="emailOK"></span>
</p>

cases

I am using the regex

/^[-\w.%+]{1,64}@(?:[A-Z0-9-]{1,63}\.){1,125}[A-Z]{2,63}$/i

Which works perfect in cases like

[email protected]
[email protected]

But it fails with accents and other Latin letters

germá[email protected]
yo@mi-compañía.com
estaçã[email protected]

6 Answers

Voted

Hewbot · Answer 1 · 2020-12-02T13:13:37+08:00

Best Answer

Hewbot

2020-12-02T13:13:37+08:002020-12-02T13:13:37+08:00

With this regular expression you can validate any email address that contains Unicode characters:

/^(([^<>()[\]\.,;:\s@\"]+(\.[^<>()[\]\.,;:\s@\"]+)*)|(\".+\"))@(([^<>()[\]\.,;:\s@\"]+\.)+[^<>()[\]\.,;:\s@\"]{2,})$/i

If you test it in a JavaScript console:

> emailRegex.test("[email protected]");
< true
> emailRegex.test("germá[email protected]");
< true

Font

From there, and as you have very well mentioned, an expression that best suits your needs would be the following:

/^(?:[^<>()[\].,;:\s@"]+(\.[^<>()[\].,;:\s@"]+)*|"[^\n"]+")@(?:[^<>()[\].,;:\s@"]+\.)+[^<>()[\]\.,;:\s@"]{2,63}$/i

107

Jorgesys · Answer 2 · 2020-12-02T13:14:07+08:00

There are certain restrictions for emails but I can comment that they should regularly be based on these rules:

Uppercase and lowercase letters of the English alphabet.

Numbers from 0 to 9

can contain dot but not at start or repeat.

you can use the characters: !#$%&'*+-/=?^_`{|}~

There are restrictions with certain types of email for example if they contain:

Greek alphabet.

Cyrillic characters.

Japanese characters.

Latin alphabet with diacritics.

Examples not accepted as valid email addresses:

червь.ca®[email protected]

josé.patroñ[email protected]

See more :

https://en.wikipedia.org/wiki/Email_address https://www.rfc-editor.org/rfc/rfc5322

I imagine an email with Cyrillic characters, even worse if what you want is to store that data in a DB, what type of SQL collation to use!

But well, the question refers to how to validate this type of emails, this is a script that would help with the task:

function validarEmail(valor) {
  if (/^(([^<>()[\]\.,;:\s@\"]+(\.[^<>()[\]\.,;:\s@\"]+)*)|(\".+\"))@(([^<>()[\]\.,;:\s@\"]+\.)+[^<>()[\]\.,;:\s@\"]{2,})$/i.test(valor)){
   alert("La dirección de email " + valor + " es correcta!.");
  } else {
   alert("La dirección de email es incorrecta!.");
  }
}

for instance:

validarEmail("jorgé[email protected]");

The script would show you that the email address is correct.

Update:

It is now possible to use international characters in domain names and email addresses .

Traditional email addresses are limited to characters from the English alphabet and a few other special characters. The following are valid traditional email addresses:

  [email protected]                                (English, ASCII)
  [email protected]                            (English, ASCII)
  user+mailbox/[email protected]   (English, ASCII)
  !#$%&'*+-/=?^_`.{|}[email protected]               (English, ASCII)
  "Abc@def"@example.com                          (English, ASCII)
  "Fred Bloggs"@example.com                      (English, ASCII)
  "Joe.\\Blow"@example.com                       (English, ASCII)

International email, by contrast, uses Unicode characters encoded as UTF-8 , which allows the text of addresses to be encoded in most of the world's writing systems.

The following are all valid international email addresses:

  用户@例子.广告                   (Chinese, Unicode)
  अजय@डाटा.भारत                    (Hindi, Unicode)
  квіточка@пошта.укр             (Ukrainian, Unicode)
  θσερ@εχαμπλε.ψομ               (Greek, Unicode)
  Dörte@Sörensen.example.com     (German, Unicode)
  аджай@экзампл.рус              (Russian, Unicode)

SnareChops · Answer 3 · 2020-12-02T13:19:21+08:00

SnareChops

2020-12-02T13:19:21+08:002020-12-02T13:19:21+08:00

I've found an article here that talks about a few different regular expression statements that can verify email addresses based on the RFC standard. There are many different recommended regular expression statements and there is no single all-in-one solution. But this regex is probably the one I'd go with, adding accented characters to the list of valid characters as well.

\A[a-z0-9!#$%&'*+/=?^_`{|}~-]+(?:\.[a-z0-9!#$%&'*+/=?^_`{|}~-]+)*@(?:[a-z0-9](?:[a-z0-9-]*[a-z0-9])?\.)+[a-z0-9](?:[a-z0-9-]*[a-z0-9])?\z

19

Braiam · Answer 4 · 2020-12-03T14:34:01+08:00

Braiam

2020-12-03T14:34:01+08:002020-12-03T14:34:01+08:00

How to validate an email that accepts all Latin characters?

The only 100% secure way to verify if an email is valid is by sending one. If the user typed the email wrong, they will simply retry.

According to RFC 5322 , [email protected]it is a "valid" email, but is anyone going to receive it? Is there a server behind the domain that accepts emails? Those are the concerns you should have. Whatever you are doing, a mailing list, registration, etc. You must send a confirmation email to validate it . The implementation will depend on the stack you use (C#, PHP, Java?) and you will have valid emails that someone receives.

You can implement something on the client side that at least says "this is an email address", but it shouldn't be your "validation" tool, it's just trying to make the user realize that what they typed is # ($^ %#$@^( #$^.com. If the client uses a modern browser, you can use <input type="email">in your form, this will eliminate the need to maintain the regex.

16

A. Cedano · Answer 5 · 2020-03-18T17:50:26+08:00

Simply to point out that, according to the official specification , the REGEX that represents an orthographically valid email address is the following:

/^[a-zA-Z0-9.!#$%&'*+/=?^_`{|}~-]+@[a-zA-Z0-9](?:[a-zA-Z0-9-]{0,61}[a-zA-Z0-9])?(?:\.[a-zA-Z0-9](?:[a-zA-Z0-9-]{0,61}[a-zA-Z0-9])?)*$/

I put the term spelling valid email address on purpose , because what defines a really valid email address is that it works, that is, that it exists and can receive emails.

It follows that a verification via Javascript is not enough. It can help us do a spell check , provided Javascript is enabled on the client side.

If you want to verify that the email really exists , there is no other way than to send an email and have the recipient reply. This is what can be called with all property real validation of an email .

In fact, that is what all serious subscription services do, they send us an email that we must verify in order to be definitively registered on their sites or in their distribution lists.

Allow me to graphically show the steps to validate an e-mail. We will see that what is discussed here is just stage 2 of a validation process that would comprise 5 stages :

Stage 1 : The user writes an e-mail
Stage 2 : Spell validationof the e-mail written by the user
Stage 3 : Check if the domain corresponding to the orthographically validated e-mail has an e-mail server
Stage 4 : Send a request (ping) or an email to verify that the server is accepting emails
Stage 5 : The e-mail was received correctly at that address and the user confirms in some way that they have received it (by clicking on a link, sending a reply email, etc.)

Until we reach stage 5, we cannot say that the email has been validated .

If the OP still asks for a validation method that accepts addresses with ñ and other characters not defined so far by the official w3.org spec (link above), the REGEX mentioned in a previous answer works.

The code that follows is the same used in the question, but implementing on the one hand the official REGEX and the REGEX that allows Latin characters such as ñ.

document.getElementById('email').addEventListener('input', function() {
    campo = event.target;
    valido = document.getElementById('emailOK');
        
  var reg = /^(([^<>()[\]\\.,;:\s@\"]+(\.[^<>()[\]\\.,;:\s@\"]+)*)|(\".+\"))@((\[[0-9]{1,3}\.[0-9]{1,3}\.[0-9]{1,3}\.[0-9]{1,3}\])|(([a-zA-Z\-0-9]+\.)+[a-zA-Z]{2,}))$/;

 var regOficial = /^[a-zA-Z0-9.!#$%&'*+/=?^_`{|}~-]+@[a-zA-Z0-9](?:[a-zA-Z0-9-]{0,61}[a-zA-Z0-9])?(?:\.[a-zA-Z0-9](?:[a-zA-Z0-9-]{0,61}[a-zA-Z0-9])?)*$/;

    //Se muestra un texto a modo de ejemplo, luego va a ser un icono
    if (reg.test(campo.value) && regOficial.test(campo.value)) {
      valido.innerText = "válido oficial y extraoficialmente";
    } else if (reg.test(campo.value)) {
      valido.innerText = "válido extraoficialmente";

    } else {
      valido.innerText = "incorrecto";

}
});

<p>
    Email:
    <input id="email">
    <span id="emailOK"></span>
</p>

Spell check in HTML5

HTML5 allows us to declare our inputemail type and handles (partly) the validation for us, as MDN says :

email: The attribute represents an email address. Line breaks are automatically removed from the entered value. An invalid email address can be entered, but the input field will only work if the address satisfies the output ABNF 1*( atext / "." ) "@" ldh-str 1*( "." ldh-str )where it atextis defined in RFC 5322, section 3.2.3 and ldh-stris defined in RFC 1034, section 3.5.

It can be combined emailwith the attribute pattern:

pattern: A regular expression against which the value is evaluated. The pattern must match the entire value, not just part of it. The title attribute can be used to describe the pattern to help the user. This attribute applies when the attribute typeis text, search, tel, url, email, or password, and is ignored otherwise. The regular expression language is the same as the JavaScript RegExp algorithm, with the parameter 'u'allowing the pattern to be treated as a sequence of Unicode code. The pattern is not surrounded by diagonals.

The downside is that not all clients support HTML5.

<form>
<input type="email" pattern='^(([^<>()[\]\\.,;:\s@\"]+(\.[^<>()[\]\\.,;:\s@\"]+)*)|(\".+\"))@((\[[0-9]{1,3}\.[0-9]{1,3}\.[0-9]{1,3}\.[0-9]{1,3}\])|(([a-zA-Z\-0-9]+\.)+[a-zA-Z]{2,}))$' title="Entre un email válido"  placeholder="Entre su email">
<input type="submit" value="Submit">
</form>

dayer · Answer 6 · 2020-12-02T16:46:48+08:00

According to RFC 6531, more characters than we are used to should be supported. But the servers limit it with previous ones. I don't see a solution with a single range that involves entering "all latin characters". Although they seem to go together (as in this table from 0080 to 00FF ), there are others in between.

A possible regex for the latin characters you might be interested in ( source ) and adding the ( suggestion ):

/[A-Za-z\u0021-\u007F\u00C0-\u00D6\u00D8-\u00f6\u00f8-\u00ff]+/g

It could be combined with your regex, the ones already indicated above or one according to RFC 2822, like this, so that it does not exclude the ranges that interest you (there are many types of accents) ( source ):

^([^\x00-\x20\x22\x28\x29\x2c\x2e\x3a-\x3c\x3e\x40\x5b-\x5d\x7f-\xff]+|\x22([^\x0d\x22\x5c\x80-\xff]|\x5c[\x00-\x7f])*\x22)(\x2e([^\x00-\x20\x22\x28\x29\x2c\x2e\x3a-\x3c\x3e\x40\x5b-\x5d\x7f-\xff]+|\x22([^\x0d\x22\x5c\x80-\xff]|\x5c[\x00-\x7f])*\x22))*\x40([^\x00-\x20\x22\x28\x29\x2c\x2e\x3a-\x3c\x3e\x40\x5b-\x5d\x7f-\xff]+|\x5b([^\x0d\x5b-\x5d\x80-\xff]|\x5c[\x00-\x7f])*\x5d)(\x2e([^\x00-\x20\x22\x28\x29\x2c\x2e\x3a-\x3c\x3e\x40\x5b-\x5d\x7f-\xff]+|\x5b([^\x0d\x5b-\x5d\x80-\xff]|\x5c[\x00-\x7f])*\x5d))*$

Validate an email in JavaScript that accepts all Latin characters

Ask

Context

Code

cases

Spell check in HTML5

HTML button that sends you to another page

Why do I get the error "Call to undefined function mysql_connect()"?

How to create an HTML button that works as a link?

How to separate a String in Java. How to use split()

Filter by dates in sql server

How to limit the number of decimal places in a double?

For each in JavaScript?

Position footer ALWAYS glued to the footer

Definitive Guide to Type Conversion in Java

How to properly compare Strings (and objects) in Java?