Абстрактна Вікіпедія/Оновлення/2022-04-01

Оновлення Абстрактної Вікіпедії

Типізовані списки дуууже довгі

Сьогоднішній інформаційний бюлетень буде зосереджений на технічних деталях, і ви можете пропустити його, якщо не хочете занурюватися в заплутані деталі нашої моделі даних.

Протягом останніх кількох тижнів ми неодноразово стикалися з перешкодою, але ще не усунули її, а саме, як представити типізований список, щоб представлення не ставало дуже довгим. Список у Вікіфункціях — це тип, який містить невідому кількість елементів у певному порядку.

In the Wikifunctions data model, a list is represented the same way as it is in the programming language Lisp: a list consists of two parts, a “head”, which is the first element of the list, and a “tail”, which itself is a list and holds the rest of the list. In addition, there is the empty list, which has no head or tail and represents a list with zero elements. In Wikifunctions, the type for a list is Z10/List.

So a list with two elements "a" and "b" would, represented in our usual JSON notation, look like this:

{ "type": "<span lang="en" dir="ltr" class="mw-content-ltr">List</span>", "head": "a" "tail": { "type": "<span lang="en" dir="ltr" class="mw-content-ltr">List</span>", "head": "b", "tail": { "type": "<span lang="en" dir="ltr" class="mw-content-ltr">List</span>" } } }	{ "Z1K1": "Z10", "Z10K1": "a" "Z10K2": { "Z1K1": "Z10", "Z10K1": "b", "Z10K2": { "Z1K1": "Z10" } } }

{
  "type": "<span lang="en" dir="ltr" class="mw-content-ltr">List</span>",
  "head": "a"
  "tail": {
    "type": "<span lang="en" dir="ltr" class="mw-content-ltr">List</span>",
    "head": "b",
    "tail": {
      "type": "<span lang="en" dir="ltr" class="mw-content-ltr">List</span>"
    }
  }
}

{
  "Z1K1": "Z10",
  "Z10K1": "a"
  "Z10K2": {
    "Z1K1": "Z10",
    "Z10K1": "b",
    "Z10K2": {
      "Z1K1": "Z10"
    }
  }
}

As this gets a bit long, we have introduced a shortcut by using JSON’s array notation, and so the list can be represented as follows:

[ "a", "b" ]

The short notation (we call it canonicalized) can be easily and mechanically translated into the longer notation seen above, and the other way around.

In Phase η we introduced the possibility to have typed lists. Whereas in the normal list we don’t say anything about the type of the elements of the list, in a typed list we say “every element of the list must be of a certain type”. Having typed lists allows us to have stronger guarantees on typing: if you ask for an element of a typed list, you know what type the result will be. This can be helpful with type checking and with providing better interfaces.

A typed list is the result of a function call. We have a function, Z881/Typed list, that takes a type as its sole argument, and returns a type. The resulting type is a copy of Z10/List, with the difference that instead of the head having a value of any type, it must have a value of the type provided as the argument. And the tail is not of type Z10/List, but of the type returned by calling Z881/Type list with the given argument type.

Let’s say we want to represent the list we had before, but instead of it being a Z10/List it should be a list of strings. A list of strings is a type which is created by calling the function Z881/Typed list on the type Z6/String. This would look as follows in JSON notation.

{ "type": { "type": "<span lang="en" dir="ltr" class="mw-content-ltr">Function call</span>", "function": "<span lang="en" dir="ltr" class="mw-content-ltr">Typed list</span>", "argument type": "<span lang="en" dir="ltr" class="mw-content-ltr">String</span>" }, "head": "a" "tail": { "type": { "type": "<span lang="en" dir="ltr" class="mw-content-ltr">Function call</span>", "function": "<span lang="en" dir="ltr" class="mw-content-ltr">Typed list</span>", "argument type": "<span lang="en" dir="ltr" class="mw-content-ltr">String</span>" }, "head": "b", "tail": { "type": { "type": "<span lang="en" dir="ltr" class="mw-content-ltr">Function call</span>", "function": "<span lang="en" dir="ltr" class="mw-content-ltr">Typed list</span>", "argument type": "<span lang="en" dir="ltr" class="mw-content-ltr">String</span>" } } } }	{ "Z1K1": { "Z1K1": "Z7", "Z7K1": "Z881", "Z881K1": "Z6" }, "K1": "a" "K2": { "Z1K1": { "Z1K1": "Z7", "Z7K1": "Z881", "Z881K1": "Z6" }, "Z10K1": "b", "Z10K2": { "Z1K1": { "Z1K1": "Z7", "Z7K1": "Z881", "Z881K1": "Z6" } } } }

{
  "type": {
    "type": "<span lang="en" dir="ltr" class="mw-content-ltr">Function call</span>",
    "function": "<span lang="en" dir="ltr" class="mw-content-ltr">Typed list</span>",
    "argument type": "<span lang="en" dir="ltr" class="mw-content-ltr">String</span>"
  },
  "head": "a"
  "tail": {
    "type": {
      "type": "<span lang="en" dir="ltr" class="mw-content-ltr">Function call</span>",
      "function": "<span lang="en" dir="ltr" class="mw-content-ltr">Typed list</span>",
      "argument type": "<span lang="en" dir="ltr" class="mw-content-ltr">String</span>"
    },
    "head": "b",
    "tail": {
      "type": {
        "type": "<span lang="en" dir="ltr" class="mw-content-ltr">Function call</span>",
        "function": "<span lang="en" dir="ltr" class="mw-content-ltr">Typed list</span>",
        "argument type": "<span lang="en" dir="ltr" class="mw-content-ltr">String</span>"
      }
    }
  }
}

{
  "Z1K1": {
    "Z1K1": "Z7",
    "Z7K1": "Z881",
    "Z881K1": "Z6"
  },
  "K1": "a"
  "K2": {
    "Z1K1": {
      "Z1K1": "Z7",
      "Z7K1": "Z881",
      "Z881K1": "Z6"
    },
    "Z10K1": "b",
    "Z10K2": {
      "Z1K1": {
        "Z1K1": "Z7",
        "Z7K1": "Z881",
        "Z881K1": "Z6"
      }
    }
  }
}

The challenge is that we cannot have the same short notation for this, as we would lose information.

We have tried to work around this by guessing the type: we look into the list, and if everything has the same type, we declare it a typed list. But first, it adds quite a bit of complexity to the code, and second it still does not solve all issues: the short form of the above list could indeed be a list of objects, or it could be a list of strings - there is no way for us to know.

We are publishing our internal discussion and a few options, and we would like to hear your thoughts and gather some wider input.