[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

NAT64 and matching identities



So it turns out that in many cases a missing www is causing the "no IPv4"
response, and someone from Alexa does need to explain what is going on. For
the entire top-1m.csv file, 35,554 entries returned "no IPv4". For each
entry in the csv file;
some return NXDOMAIN:
discart.ru  -->  NXDOMAIN
     www.discart.ru resolves, but Alexa file missing www.
     Alexa position:            4721,discart.ru

some return without any answer:
bp.blogspot.com  --> No Answer
     www.bp.blogspot.com resolves, but Alexa file missing www.
     Alexa position:            87,bp.blogspot.com

while others point to MX-only entries:
akamaihd.net  -->  MX-only
     www.akamaihd.net resolves, but Alexa file missing www.
     Alexa position:            74,akamaihd.net

The version of the code I have been using strips the www. and tries again,
but obviously it also needs to add the www. and retry. In any case, the
Alexa file points to names that do not serve web content, so the entire 'top
1M' list is suspect.

Tony


> -----Original Message-----
> From: Tony Hain [mailto:alh-ietf at tndh.net]
> Sent: Friday, November 22, 2013 3:50 PM
> To: 'Owen DeLong'
> Cc: Sherfesee at amazon.com; 'NANOG List'
> Subject: RE: NAT64 and matching identities
> 
> Someone from Alexa really needs to answer how that list is created because
> their web site discussion is way too hand-wavy, but given that neither of
> those appear to be currently valid names, and 1.1.1.1 is on the list at
all, there
> must be some measure of cross link and redirection occurrences. For the
> entire top-1m, I show today's file has 2815 as dotted-quad. In the top
> 50,000 there are 1790 with "no IPv4   no IPv6".  Clearly they don't bother
> to prune the list for validity. ~4% of the next 25,000 names are dead
(50,000-
> 75,000), and one can only guess that as you get further down the list the
> percentage of dead names will continue to go up. I have a full 1M run in
> process, but would not count on it completing before Monday.
> 
> Just to add a level of 'extra effort' to the process, I increased the
number of
> attempts to 10, and the time between attempts to 10 seconds. With that,
> dead names in the top 1000:
> akamaihd.net                    no IPv4   no IPv6
> bp.blogspot.com                 no IPv4   no IPv6
> delta-search.com                no IPv4   no IPv6
> bannersdontwork.com             no IPv4   no IPv6
> cloudfront.net                  no IPv4   no IPv6
> doorblog.jp                     no IPv4   no IPv6
> uimserv.net                     no IPv4   no IPv6
> linksynergy.com                 no IPv4   no IPv6
> lipixeltrack.com                no IPv4   no IPv6
> australianbrewingcompany.com    no IPv4   no IPv6
> searchfun.in                    no IPv4   no IPv6
> greatappsdownload.com           no IPv4   no IPv6
> klikbca.com                     no IPv4   no IPv6
> jobfindgold.info                no IPv4   no IPv6
> adnxs.com                       no IPv4   no IPv6
> rakuten.ne.jp                   no IPv4   no IPv6
> sweetpacks-search.com           no IPv4   no IPv6
> yomiuri.co.jp                   no IPv4   no IPv6
> incredibar-search.com           no IPv4   no IPv6
> searchgol.com                   no IPv4   no IPv6
> livedoor.biz                    no IPv4   no IPv6
> workercn.cn                     no IPv4   no IPv6
> 
> FWIW: in the top 50,000, I show 1525 "has IPv4  has IPv6" &  0  "no IPv4
has
> IPv6". In other words, there are more dead names than there are AAAA
> records, and there are not any IPv6-only sites in that group.
> 
> Tony
> 
> 
> > -----Original Message-----
> > From: Owen DeLong [mailto:owen at delong.com]
> > Sent: Friday, November 22, 2013 1:48 PM
> > To: Tony Hain
> > Cc: joel jaeggli; Valdis.Kletnieks at vt.edu; NANOG List
> > Subject: Re: NAT64 and matching identities
> >
> > So one has to wonder how those names made it into the top 100 list if
> > it's supposed to be a top 100 web sites, since they are obviously not
> > web
> sites.
> > (at least in the case of the two in the top 100)
> >
> > Owen
> >
> > On Nov 22, 2013, at 1:28 PM, Tony Hain <alh-ietf at tndh.net> wrote:
> >
> > > The only thing it explicitly strips out are dotted-quads, which
> > > don't occur until # 4255. The code makes five passes at
> > > getaddrinfo() for
> > > IPv4 before giving up, and then it checks for a leading www and if
> > > that exists it strips it off and does the 5 tries loop again, then
> > > later the same process for IPv6. For the top 100 run:
> > > akamaihd.net                    no IPv4   no IPv6
> > > bp.blogspot.com                 no IPv4   no IPv6
> > >
> > > FWIW :::
> > > Dotted-quad's in the top 10,000
> > > 4255,92.242.195.24
> > > 4665,1.1.1.1
> > > 5079,92.242.195.231
> > > 6130,1.254.254.254
> > > 9518,208.98.30.70
> > >
> > >> whois 92.242.195.24
> > > ...
> > > netname:        Respina
> > > descr:          BroadBand IP Pool
> > > country:        IR
> > > ...
> > > route:          92.242.195.0/24
> > >
> > > Respina BroadBand IP Pool in the top 100,000
> > > 4255,92.242.195.24
> > > 5079,92.242.195.231
> > > 10059,92.242.195.233
> > > 23912,92.242.195.30
> > > 31520,92.242.195.111
> > > 35867,92.242.195.235
> > > 95233,92.242.195.129
> > >
> > >
> > >> -----Original Message-----
> > >> From: Owen DeLong [mailto:owen at delong.com]
> > >> Sent: Friday, November 22, 2013 12:16 PM
> > >> To: joel jaeggli
> > >> Cc: Valdis.Kletnieks at vt.edu; Tony Hain; NANOG List
> > >> Subject: Re: NAT64 and matching identities
> > >>
> > >> It would be way more than 2 if it were CNAME, methinks.
> > >>
> > >> Owen
> > >>
> > >> On Nov 22, 2013, at 12:12 PM, joel jaeggli <joelja at bogus.com> wrote:
> > >>
> > >>> On 11/22/13, 12:01 PM, Valdis.Kletnieks at vt.edu wrote:
> > >>>> On Fri, 22 Nov 2013 10:18:27 -0800, "Tony Hain" said:
> > >>>>
> > >>>>> The top 100 websites: AAAA records and IPv6 connectivity
> > >>>>>          count with A:       98   ( 98.000%)
> > >>>>>       count with AAAA:       30   ( 30.000%)
> > >>>>> Of the 30 hosts with AAAA records, testing connectivity to TCP/80:
> > >>>>>    count with IPv6 ok:       30   (100.000%)
> > >>>>
> > >>>> Statistics whoopsie, or are there actually 2 sites in the top100
> > >>>> that are IPv6-only?
> > >>>
> > >>> IN CNAME ? or is that being accounted for.
> > >>>
> > >>>
> > >>>
>