← Back

Domain Overview: gigaword.dk

Domain Information

FieldValue
ID 93351
Domain gigaword.dk
Title Danish Gigaword
Meta Description Danish Gigaword Corpus: A billion words of Danish, in one open and free dataset
Headings Danish Gigaword Introduction Download Documentation License & Reference Models using Danish Gigaword Tools using Danish Gigaword Press Coverage Contact Credits
Structured Data
[
  {
    "@context": "https://schema.org",
    "@type": "WebSite",
    "url": "https://gigaword.dk"
  },
  {
    "@context": "https://schema.org",
    "@type": "Project",
    "@id": "https://gigaword.dk",
    "name": "Danish Gigaword",
    "logo": "https://gigaword.dk/images/icon_hua9bb42c2605399adb6592c77a58560a5_36603_192x192_fill_lanczos_center_2.png",
    "url": "https://gigaword.dk"
  },
  {
    "@context": "http://schema.org",
    "@type": "Dataset",
    "name": "Danish Gigaword",
    "description": "A billion-word corpus of Danish text, freely distributed with attribution.",
    "spatialCoverage": "Denmark",
    "version": 1.0,
    "license": "https://creativecommons.org/licenses/by/4.0/",
    "distribution": {
      "@type": "DataDownload",
      "contentUrl": "https://bit.ly/danishgigaword10"
    },
    "sourceOrganization": "IT University of Copenhagen"
  }
]
Internal Link(s)
Show all 6 links
External Link(s)
Show all 22 links
Found In demo.sun.dictus.dk
HTTP Status 200
IP Address 35.157.26.135
Response Time 5.12855
Server Header Netlify
Page Load Speed 5.12855
Favicon URL http://gigaword.dk/images/icon_hua9bb42c2605399adb6592c77a58560a5_36603_32x32_fill_lanczos_center_2.png
WHOIS Status Active
Created Date 19. februar 2023
Registrant Name Leon Richard Anthony Strømberg-Derczynski
Registrant Address Tæbyvej 14
Registrant City 2610 Rødovre
Registrant Country Danmark
Registrant Phone -
Registrar None
Nameservers ["dns1.p05.nsone.net", "dns2.p05.nsone.net", "dns3.p05.nsone.net", "dns4.p05.nsone.net"]
Registered 2023-02-19
Expires 2026-02-18
Crawl Timestamp 2025-07-21 02:36:25

Inbound Links (Total: 0)

IDSource DomainSource PageLink URLDiscovered AtLast Active

Outbound Links (Total: 0)

IDDestination DomainDestination PageLink URLDiscovered AtLast Active

Historical Snapshots

2025-05-28 2025-05-11 2025-04-27 2025-04-17 2025-04-08

Historical Domain Information

FieldValue
ID 93351
Domain gigaword.dk
Title Danish Gigaword
Meta Description Danish Gigaword Corpus: A billion words of Danish, in one open and free dataset
Headings Danish Gigaword Introduction Download Documentation License & Reference Models using Danish Gigaword Tools using Danish Gigaword Press Coverage Contact Credits
Structured Data
["{\"@context\":\"https://schema.org\",\"@type\":\"WebSite\",\"url\":\"https://gigaword.dk\"}", "{\"@context\":\"https://schema.org\",\"@type\":\"Project\",\"@id\":\"https://gigaword.dk\",\"name\":\"Danish Gigaword\",\"logo\":\"https://gigaword.dk/images/icon_hua9bb42c2605399adb6592c77a58560a5_36603_192x192_fill_lanczos_center_2.png\",\"url\":\"https://gigaword.dk\"}", "{\"@context\":\"http://schema.org\",\"@type\":\"Dataset\",\"name\":\"Danish Gigaword\",\"description\":\"A billion-word corpus of Danish text, freely distributed with attribution.\",\"spatialCoverage\":\"Denmark\",\"version\":\"1.0\",\"license\":\"https://creativecommons.org/licenses/by/4.0/\",\"distribution\":{\"@type\":\"DataDownload\",\"contentUrl\":\"https://bit.ly/danishgigaword10\"},\"sourceOrganization\":\"IT University of Copenhagen\"}"]
Internal Link(s) ["https://gigaword.dk", "https://gigaword.dk", "https://gigaword.dk/", "https://gigaword.dk/", "http://gigaword.dk", "http://gigaword.dk"]
External Link(s) ["https://en.itu.dk/about-itu/press/news-from-itu/2021/itu-led-project-will-make-automated-translation-more-reliable", "https://bit.ly/danishgigaword10", "https://aclanthology.org/2021.nodalida-main.46/", "https://creativecommons.org/licenses/by/4.0/", "https://huggingface.co/Maltehb/aelaectra-danish-electra-small-cased", "https://ogtal.dk", "https://www.sketchengine.eu/danish-gigaword-corpus/?s=", "https://www.dr.dk/nyheder/viden/teknologi/heste-nettet-kan-blive-grundlag-kunstig-intelligens-paa-dansk", "https://www.dr.dk/lyd/special-radio/prompt/prompt-2023/prompt-hestenet-toerstige-prompts-og-chatbot-der-kan-hoere-og-se-11802321008", "https://www.bloomberg.com/news/newsletters/2023-09-22/danish-ai-trained-on-data-from-a-web-forum-about-horses", "https://www.heste-nettet.dk/nyheder/15285/", "https://borsen.dk/nyheder/opinion/debat-staten-investerer-stort-i-kunstig-intelligens-men-rammer-forbi-skiven", "https://investindk.com/insights/denmark-to-strenghten-opportunities-for-nlp-businesses", "https://jack-clark.net/2021/06/07/import-ai-252-gait-surveillance-a-billion-danish-words-deepmind-makes-phone-using-agents/", "https://sprogteknologi.dk/blog/danish-gigaword-project-et-historisk-stort-dansk-tekstkorpus", "https://en.itu.dk/about-itu/press/news-from-itu/2021/itu-led-project-will-make-automated-translation-more-reliable", "https://politiken.dk/indland/art8205046/Superalgoritme-kortl%C3%A6gger-det-danske-had-og-afsl%C3%B8rer-yndlingsofrene-p%C3%A5-Facebook", "https://www.kmd.dk/presse/pressemeddelelser-og-nyheder/sprogmodellen-aelaectra-vil-forbedre-dansk-sprogteknologi-paa-en-klimavenlig-maade", "https://www.morningbrew.com/emerging-tech/stories/2021/03/29/one-biggest-advancements-ai-also-sparked-fierce-debate-heres", "https://www.pexels.com/photo/aerial-photo-of-beach-3596017/", "https://wowchemy.com/?utm_campaign=poweredby", "https://github.com/wowchemy/wowchemy-hugo-modules"]
Found In -
HTTP Status 200
IP Address 3.124.100.143
Response Time 4.01062
Server Header Netlify
Page Load Speed 4.01062
Favicon URL http://gigaword.dk/images/icon_hua9bb42c2605399adb6592c77a58560a5_36603_32x32_fill_lanczos_center_2.png
WHOIS Status Active
Created Date 19. februar 2023
Registrant Name Leon Richard Anthony Strømberg-Derczynski
Registrant Address Tæbyvej 14
Registrant City 2610 Rødovre
Registrant Country Danmark
Registrant Phone -
Registrar None
Nameservers ["dns1.p05.nsone.net", "dns2.p05.nsone.net", "dns3.p05.nsone.net", "dns4.p05.nsone.net"]
Registered 2023-02-19
Expires 2026-02-18
Crawl Timestamp 2025-04-17 18:23:13