[Fwd: [Html-widget] formfu - new features]

Jason Kohles email at jasonkohles.com
Wed Mar 14 02:08:09 GMT 2007


On Mar 13, 2007, at 11:35 AM, Mario Minati wrote:

>
> With Regex::Common I found some address that still validate but  
> which are not
> valid, at least I've never seen addresses like them:
>
> https://minati.de./
> (the point after 'de' shouldn't be valid)
>
Yes it is, all hostnames actually end with a . (the DNS root), but  
nobody requires you
to enter it since they all end with one.  There are instances where  
you DO want to
include it though.  For example if your DNS search order includes  
'foo.com', and you
type into your web browser http://www/ it takes you to http:// 
www.foo.com/, but what
happens if your DNS server search order includes foo.com and you have  
a host
named minati.de.foo.com?  Will you go to minati.de or to  
minati.de.foo.com?  To
make sure you get just minati.de and not minati.de.foo.com you can use
http://minati.de./

> On the other hand this url
> https://minati.de/index.html#lkj
> is invalid, but that might be some trubble with the '#' and the  
> encoding.
> (I'm fighting with utf-8 at the moment, do you have experience in  
> that Carl?)
>

 From 'RFC Uniform Resource Identifiers (URI): Generic Syntax'

2.4.3. Excluded US-ASCII Characters

    Although they are disallowed within the URI syntax, we include  
here a
    description of those US-ASCII characters that have been excluded and
    the reasons for their exclusion.

... snip ...

    The character "#" is excluded because it is used to delimit a URI  
from a
    fragment identifier in URI references (Section 4).

Your URI isn't valid unless you encode the #

-- 
Jason Kohles
email at jasonkohles.com
http://www.jasonkohles.com/
"A witty saying proves nothing."  -- Voltaire





More information about the Html-widget mailing list