| ▲ | Python Type Checker Comparison: Empty Container Inference(pyrefly.org) |
| 32 points by ocamoss 4 days ago | 15 comments |
| |
|
| ▲ | Boxxed 2 hours ago | parent | next [-] |
| My favorite part about the type annotations in python is that it steers you into a sane subset of the language. I feel like it's kind of telling that python is this super dynamic language but the type annotations aren't powerful enough to denote all that craziness. |
|
| ▲ | jez 2 hours ago | parent | prev | next [-] |
| A more complicated version of this problem exists in TypeScript and Ruby, where there are only arrays. Python’s case is considerably simpler by also having tuples, whose length is fixed at the time of assignment. In Python, `x = []` should always have a `list[…]` type inferred. In TypeScript and Ruby, the inferred type needs to account for the fact that `x` is valid to pass to a function which takes the empty tuple (empty array literal type) as well as a function that takes an array. So the Python strategy #1 in the article of defaulting to `list[Any]` does not work because it rejects passing `[]` to a function declared as taking `[]`. |
|
| ▲ | loevborg 2 hours ago | parent | prev | next [-] |
| FWIW, Typescript is using Strategy 2: https://www.typescriptlang.org/play/?#code/GYVwdgxgLglg9mABM... I'm a bit confused by the fact that the array starts out typed as `any[]` (e.g. if you hover over the declaration) but then, later on, the type gets refined to `(string | number)[]`. IMO it would be nicer if the declaration already showed the inferred type on hover. |
| |
| ▲ | sheept 30 minutes ago | parent | next [-] | | I agree, it's always been unsettling to see any[] on hover, even though it gets typed in the end. I think one reason might be to allow the type to be refined differently in different code paths. For example: function x () {
let arr = []
if (Math.random() < 0.5) {
arr.push(0)
return arr
} else {
arr.push('0')
return arr
}
}
In each branch, arr is typed as number[] and string[], respectively, and x's return type is number[] | string[]. If it decided to retroactively infer the type of arr at declaration, then I'd imagine x's return type would be the less specific (number | string)[]. | |
| ▲ | bastawhiz 2 hours ago | parent | prev [-] | | It depends on your tsconfig. An empty array could be typed as never[], forcing you to annotate it. | | |
| ▲ | wk_end an hour ago | parent | next [-] | | I don't believe this is correct. There's no settings that correspond to that AFAIK, and it'd actually be quite bad, because you could access the empty array and then get a `never` object, which you're not supposed to be able to do. https://www.typescriptlang.org/play/?#code/GYVwdgxgLglg9mABM... `unknown[]` might be more appropriate as a default, but TypeScript does you one better: with OP's settings, although it's typed as `any[]`, it'll error out if you don't do anything to give it more information because of `noImplicitAny`. | |
| ▲ | loevborg 2 hours ago | parent | prev [-] | | Which setting specifically? Can you repro in the typescript playground? |
|
|
|
| ▲ | IshKebab 2 hours ago | parent | prev | next [-] |
| I think it would be worth mentioning that in normal use (strict mode) Pyright simply requires you to add type annotations to the declaration. Occasionally mildly annoying but IMO it's clearly the best option. |
|
| ▲ | curiousgal 2 hours ago | parent | prev [-] |
| I can't help but find type hints in python to be..goofy? I have a colleague who has a substantial C++ background and now working in python, the code is just littered with TypeAlias, Generic, cast, long Unions etc.. this can't be the way.. |
| |
| ▲ | tialaramex 2 hours ago | parent | next [-] | | Typing is a relatively easy way for the human author and the machine to notice if they disagree about what's going on before problems arise. It is unfortunate that Python doesn't do a good job with types, I was reading earlier today about the mess they made of booleans - their bool type is actually just the integers again. | | |
| ▲ | nubg 2 hours ago | parent [-] | | > I was reading earlier today about the mess they made of booleans Can you elaborate on that? | | |
| ▲ | tialaramex an hour ago | parent | next [-] | | Because Python decided that (for the usual New Jersey reason, simplicity of implementation) bool should just be an integer type the Liskov criterion comes into play. If we can X an integer and we've agreed bool is an integer => we can X a bool. That's not what booleans are but hey, it's sorta close and this was easier to implement. So, can we add two bools together? Adding booleans together is nonsense, but we've said these are a kind of integer so sure, I guess True + True = 2 ? And this cascades into nonsense like ~True being a valid operation in Python and its result is true... | |
| ▲ | IshKebab an hour ago | parent | prev [-] | | He did - booleans are integers: >>> isinstance(False, int)
True
A related screw-up is implicitly casting everything to bool. A lot of languages made that mistake.Overall I'd say they didn't do an awful job though. The main problems with Python are the absolutely abysmal tooling (which thankfully uv fixes), the abysmal performance (which sometimes isn't an issue, but it usually becomes an issue eventually), and the community's attitude to type checking. Actually type checking code you've written yourself with Pyright in strict mode is quite a pleasant experience. But woe betide you if you want to import any third party libraries. There's at least a 50% chance they have no type annotations at all, and often it's deliberate. Typescript used to have a similar problem but the Javascript community realised a lot quicker than the Python community that type hints are a no-brainer. |
|
| |
| ▲ | wiseowise 2 hours ago | parent | prev | next [-] | | What is the way in your opinion? | |
| ▲ | IshKebab 2 hours ago | parent | prev [-] | | I strongly disagree. Python has actually done a decent job of adding type annotations into the language IMO. If you ignore the bit where they don't actually specify their semantics anyway. > this can't be the way.. The alternative is fragile and unmaintainable code. I know which I prefer! |
|