Introduce an ERC-1155 _exists() function #2185

KaiRo-at · 2020-04-14T13:16:27Z

This PR is based on a discussion started at #2003 (comment) and should give the ERC-1155 implementation more compatibility to OpenZeppelin's ERC-721 implementation and also better compatibility to services like OpenSea as pointed to by https://twitter.com/xanderatallah/status/1232124941425881089

The main thing introduced here is an _exists() function that is checked in a few places where we'd otherwise return a null-ish default value (0 or ''), and to know if a token ID is valid, it has to be registered internally first - either via minting a first token on that ID, or by calling an explicit function.

Tests will fail on this PR for the moment as it bases on both #2130 and #2029 - also, I have not written any tests for this PR itself yet as it's mostly a suggestion on how we could do this in a good way.

KaiRo-at · 2020-04-14T13:20:53Z

One thing I somewhat wonder about is if requiring this explicit registration would cause any issues for people who want to use the ERC-1155 implementation from OpenZeppelin.
FWIW, for the project I work on right now, I will use this patch as it should work really well with it.

christopheradams · 2020-04-28T04:11:40Z

I have some reservations about how this matches up with the ERC-1155 spec.

The section on Enumerating from events says that "In order to keep storage requirements light for contracts implementing ERC-1155, enumeration (discovering the IDs and values of tokens) must be done using event logs." Baking the token registration storage mapping into the default contact would seem to violate this principle.

Also, the Metadata Extensions reads "The uri function MUST NOT be used to check for the existence of a token as it is possible for an implementation to return a valid string even if the token does not exist." I'm afraid clients might intentionally or unintentionally treat the presence or absence of an error generated by uri() as a de facto check. Reading the spec, I'm not sure I would expect any of the view functions to ever return errors.

KaiRo-at · 2020-04-28T21:30:48Z

Hrm, that sounds like it's an unusable standard for most usages then as you only get reliable event logs with an archive node which is really expensive to set up and operate, and services like OpenSea need a good way to check which tokens are valid. Our implementation of the project in use right now will use the patch in this PR and then I probably will advocate for people not using ERC-1155 because of its multiple shortcomings, including those two quotes from the spec.

christopheradams · 2020-05-03T01:06:27Z

Just for the record, I believe there are valid use cases for an _exists() function and a mapping to keep track of it. For example, you need this if you want to guarantee that a token can only be minted once. I'm in favor of seeing a good implementation of this, but I don't think it should be the default (requiring contract storage it may not need), unless the spec changes.

nventuro · 2020-05-08T20:39:34Z

Thanks for thos comments @christopheradams! Here's my interpretation of the spec:

The uri function MUST NOT be used to check for the existence of a token as it is possible for an implementation to return a valid string even if the token does not exist.

I think this means callers of uri() should not use it to check existence: that is, the fact that uri(id) returns with no errors doesn't necessarily mean that id exists. Nothing prevents us from adding a check ourselves if we add a notion of a token 'existing'.

In order to keep storage requirements light for contracts implementing ERC-1155, enumeration (discovering the IDs and values of tokens) must be done using event logs.

Again, the specification doesn't forbid implementations from having storage-based enumeration (which would be silly) - rather, it says that enumerating using events only should be possible. I share @KaiRo-at's concerns when it comes to building state from events: both access to archival nodes and building state-aggregating software are not trivial tasks.

I believe on-chain enumeration (sensibly applied) is a good practice, which is why we started introducing concepts such as EnumerableMap to the library, and use it for AccessControl. In that sense, we may even want to go as far as implementing this using EnumerableSet, which would let users not only check for existence but also enumerate all tokens in the system.

nventuro

Once #2029 is merged, we should rebase this PR to keep the diff to a minimum.

nventuro · 2020-05-08T20:43:48Z

contracts/token/ERC1155/ERC1155.sol

+     * @param id uint256 ID of the token to register
+     */
+    function _registerToken(uint256 id) internal virtual {
+        _tokenExists[id] = true;


With EIP2200, an SSTORE here will cost the same as an SLOAD if _tokenExists[id] is already set to true, so this is already optimal in that sense.

We may want to consider using a 32-byte word instead of a boolean however: see the comments here (tl;dr: storing booleans is more costly than full-words because of extra checks placed by the compiler).

Also, from the 1155 EIP:

To broadcast the existence of a token ID with no initial balance, the contract SHOULD emit the TransferSingle event from 0x0 to 0x0, with the token creator as _operator, and a _value of 0.

What do you think about emitting such an event here? We should only do it the first time though, so we'd want to introduce an if statement anyway, even if not required for efficiency reasons.

Ah, that is an interesting piece from the spec, I missed that one! I think it definitely would be a good idea to emit that one.
Also, I didn't even think of the double-registering case, thanks for your thoughts on that.
So, should I change _tokenExists to a uint256 (using 0 and 1 for false and true) instead? I find it somewhat sad that a bool can end up being more expensive than a 32-byte type...

Yes, I'm also not thrilled by that - we run into the same situation on this other PR.

What do you think about using EnumerableSet to store this data, instead of a plain mapping and having to deal with uint vs bool? As mentioned in the other comments, being able to enumerate this data is both important and hard to do via events. It'd mimic 721's tokenByIndex.

Well, an EnumerableSet mostly makes sense for the reverse indexing, which from what I see is completely unneded here and only costs additional gas. Is there any reason for having this really enumerable? Also, it would not change anything wrt bool vs. uint as EnumerableSet only works on bytes32 values, potentially in the variant of UintSet where they are cast to uint256 anyhow.

AC0DEM0NK3Y · 2020-05-08T22:02:50Z

#2185 (comment)

Hi. On the above comment, forgive me if I'm wrong it's been a while since I looked at a node but from what I recall you do not need a archive node to get full history of events you need an archive node if you need full history of the state.
You would need to wait for the events history to be synced also down to the right block on a normal/fast node to get full history for a contract (there are framework functions to query this) but this is way way less time and space than a full/archive node.

So, pulling all balance or uri events is fine from a normal node. Indeed this should be the case if you want to be able to track balances this way too which is way faster than having to hit the node to query.

IIRC the desire to not have something enforced on that front was not to put a gas burden on implementations as it can be inferred from a balance transfer event (either with a "create" style one or an actual transfer) and implementations such as TheSandbox and SkyWeaver did not want to pay this cost.

nventuro · 2020-05-08T22:25:28Z

Thanks for that comment @AC0DEM0NK3Y, it seems I was mistaken when it comes to node types - I believed logs were pruned after some time on regular nodes. Indeed, Infura seems to store all events on a traditional database for fast access via the eth_getLogs RPC call.

However, there seems to be many iniciatives to drop this expectation of nodes to keep logs forever. Logs are meant to be used for applications to react to activity on the blockchain, not as a form of inexpensive read-only off-chain storage. See this great gist from one of Geth lead devs on that topic.

All in all, I think this topic will become an issue in the near future and we should prepare for that. Additionally, my point about state-aggregation still stands.

AC0DEM0NK3Y · 2020-05-08T22:36:11Z

Yeah I could see the desire to prune, although I would hope they would keep a mode on that at least allows you to say "sync the history for X,Y and Z contract from block number" it's something I put as a suggestion on parity features a while ago... something like that would also be desirable for state.

Personally I think what most serious implentations would do is sync your node event history down once, fill a db with events up to head and then just feed the db as events come in to give balance and uri updates (and flag existance).
Hitting the node for every request seems like it will get quite unwieldy.

KaiRo-at · 2020-05-08T23:12:49Z

@nventuro Ah, thanks for the explanations on the spec, I let myself be misled about the interpretation as well after the comment here but re-reading the statements, i agree with you.
On the event log issue, I has some expectations that event logs may be available on fast-pruning nodes but learned the hard way that they are not - while they are not pruned at least by current Parity/OpenEthereum implementations, you only get the logs for any blocks the node has processed itself, so none from before it loaded a snapshot, and none if it may lag and jump to another snapshot (which is rare but can happen during sync or if it was offline for some reason. Even worse, you just get empty responses and no errors if you query for blocks it missed logs for. So we learned you need an archive node to be sure you have a complete event log. Of course, services like Infura or Etherscan probably operate archive nodes and also may copy the data they need into other databases they can query more efficiently.
I'll look into the comments on the code in the next days, I had an intense day and want to be able to concentrate fully when looking at those.

AC0DEM0NK3Y · 2020-05-08T23:24:45Z

You have to wait until it has synced the events history to be sure. Being "fully synced" to head block after catchup from a snapshot is not actually fully synced, you are only guaranteed to have events data from snapshot block at that point after which it will switch to history pull along with keeping up with head.
Both parity and geth aren't very good at signalling this, iirc parity had a single number that you'll see increasing in the log that shows where its currently up to.

To be sure you either have to eye/parse out from the log or you can call a function on the node to query where it is up to. Web3 last time I looked at it didn't expose this well either (but there was an easy workaround to expose the function via an extension if memory serves) but it was a simple call with ethers iirc.

stale · 2020-05-29T03:26:36Z

Hi all!
This Pull Request has not had any recent activity, is it still relevant? If so, what is blocking it? Is there anything we can do to help move it forward?
Thanks!

nventuro · 2020-06-03T14:24:22Z

We intend to release v3.1 of Contracts soon, with initial support for ERC1155. Given that _exists() is not part of the standard, and we don't yet have consensus on some of the low level details, we'll leave this improvement for v3.2.

stale · 2020-06-18T16:33:10Z

Hi all!
This Pull Request has not had any recent activity, is it still relevant? If so, what is blocking it? Is there anything we can do to help move it forward?
Thanks!

KaiRo-at · 2020-06-19T12:04:34Z

Mr. Bot, this is still relevant, I'll look into it after the releases of OpenZeppelin 3.1 and Crypto stamp 2.

stale · 2020-07-04T23:55:05Z

Hi all!
This Pull Request has not had any recent activity, is it still relevant? If so, what is blocking it? Is there anything we can do to help move it forward?
Thanks!

KaiRo-at · 2020-07-05T20:24:06Z

I'll come back to this soon, my dear bot.

frangio · 2020-07-08T20:03:06Z

There seem to be multiple things being discussed here and I think we need to define more clearly what we want and why.

Adding an exists or _exists function.
Whether the view functions should revert unless a token exists.
Whether there should be enumerability.

And I'd add whether any of this should be enabled by default in our ERC1155.

Another option would be to make sure exists can be implemented by deriving ERC1155 (e.g. using hooks) and that it does not require forking.

Based on the tweet by OpenSea, I think adding an exists function makes sense. I wonder how much gas overhead it would introduce, but it sounds like most of it would only be incurred during minting, which I think would be acceptable.

Based on the concerns expressed by the EIP document and in the comments in this issue, I'd say the view functions should not revert.

I would oppose enumerability by default, but I would consider it as an optional extension. This whole discussion about enumerability on-chain versus doing it based on events is something that we are often unsure about, though, and I think we need to have a deeper discussion about it.

nventuro · 2020-07-09T04:14:45Z

Based on the concerns expressed by the EIP document and in the comments in this issue, I'd say the view functions should not revert.

Which functions are these? balanceOf and isApprovedForAll?

frangio · 2020-07-09T13:28:45Z

And I think also url().

stale · 2020-07-25T03:04:04Z

Hi all!
This Pull Request has not had any recent activity, is it still relevant? If so, what is blocking it? Is there anything we can do to help move it forward?
Thanks!

KaiRo-at · 2020-07-29T14:10:24Z

I'm actually starting to look into this again now and hope to have a new PR this week.

…ntation and satisfy https://twitter.com/xanderatallah/status/1232124941425881089 Use an explicit registration internally to mark token IDs as existing per comments from nventuro, add an event to flag the existence of this token ID

…n other functions

KaiRo-at · 2020-07-29T15:02:08Z

OK, I rebased to current master and updated the work somewhat, no tests yet, but would be great to know if that is the right direction to go now.

stale · 2020-08-14T09:35:47Z

Hi all!
This Pull Request has not had any recent activity, is it still relevant? If so, what is blocking it? Is there anything we can do to help move it forward?
Thanks!

KaiRo-at · 2020-08-14T10:46:39Z

I'm waiting for feedback on the current approach and for time to work on tests.

frangio

@KaiRo-at It's looking pretty good I think. I like the direction, it's a very simple addition.

frangio · 2020-08-19T00:10:06Z

contracts/token/ERC1155/ERC1155.sol

@@ -31,6 +31,9 @@ contract ERC1155 is Context, ERC165, IERC1155, IERC1155MetadataURI {
    // Used as the URI for all token types by relying on ID substition, e.g. https://token-cdn-domain/{id}.json
    string private _uri;

+    // Mapping token ID to that token being registered as existing (1 for existing, 0 for not existing)
+    mapping (uint256 => uint256) private _tokenExists;


Why is this uint256 => uint256 instead of uint256 => bool?

That's because of #2185 (comment) - do you think that's a bad idea?

frangio · 2020-08-19T00:20:49Z

What do you think about making exists public?

There is a potential issue related to upgradeable ERC1155 contracts. If contract is upgraded from the current ERC1155 version to the one in this PR, exists will return incorrect values because the mapping will not be populated with the already existing contracts. What do you think about that? Is there anything that could be done, other than exposing an external function to allow already existing tokens to be registered?

KaiRo-at · 2020-09-16T16:24:55Z

What do you think about making exists public?

I mainly followed the pattern we also have in the ERC721 OpenZeppelin contract, leaving it to implementers to expose it publicly. But I'm open to making it public right away.

There is a potential issue related to upgradeable ERC1155 contracts. If contract is upgraded from the current ERC1155 version to the one in this PR, exists will return incorrect values because the mapping will not be populated with the already existing contracts. What do you think about that? Is there anything that could be done, other than exposing an external function to allow already existing tokens to be registered?

I haven't thought about upgradeable contracts before as I usually don't do them, but yes, in this case, I think, yes, they would need to call _registerToken(uint256 id) with the IDs that have been created before.

stale · 2020-10-17T20:26:25Z

Hi folks!
This Pull Request is being closed as there was no response to the previous prompt. However, please leave a comment whenever you're ready to resume, so it can be reopened.
Thanks again!

KaiRo-at changed the title ~~Erc1155 exists~~ Introduce an ERC-1155 _exists() function Apr 14, 2020

KaiRo-at mentioned this pull request Apr 14, 2020

Improve extensibility of ERC1155 #2003

Closed

KaiRo-at force-pushed the erc1155-exists branch from 9af78df to 8bbca1d Compare April 14, 2020 13:32

nventuro mentioned this pull request Apr 20, 2020

Contracts Roadmap Q2 2020 #2207

Closed

nventuro reviewed May 8, 2020

View reviewed changes

stale bot added stale and removed stale labels May 29, 2020

stale bot added the stale label Jun 18, 2020

nventuro removed the stale label Jun 19, 2020

stale bot added the stale label Jul 4, 2020

stale bot removed the stale label Jul 5, 2020

stale bot added the stale label Jul 25, 2020

stale bot removed the stale label Jul 29, 2020

KaiRo-at added 2 commits July 29, 2020 16:55

rework _exists() and _registerToken() slightly, do not require() it i…

4d530c5

…n other functions

KaiRo-at force-pushed the erc1155-exists branch from 59315b5 to 4d530c5 Compare July 29, 2020 15:00

KaiRo-at changed the base branch from feature-erc1155 to master July 29, 2020 15:03

stale bot added the stale label Aug 14, 2020

stale bot removed the stale label Aug 14, 2020

frangio reviewed Aug 19, 2020

View reviewed changes

This comment has been minimized.

Sign in to view

stale bot added the stale label Sep 4, 2020

stale bot removed the stale label Sep 16, 2020

This comment has been minimized.

Sign in to view

stale bot added the stale label Oct 2, 2020

stale bot closed this Oct 17, 2020

frangio mentioned this pull request Feb 24, 2021

ERC1155Supply, an extension that keep track of totalSupply for ERC1155 tokens #2536

Closed

Amxx mentioned this pull request Mar 15, 2021

Introduce ERC1155 totalSupply() and exists() functions #2593

Merged

3 tasks

Introduce an ERC-1155 _exists() function #2185

Introduce an ERC-1155 _exists() function #2185

Conversation

KaiRo-at commented Apr 14, 2020

KaiRo-at commented Apr 14, 2020

christopheradams commented Apr 28, 2020

KaiRo-at commented Apr 28, 2020

christopheradams commented May 3, 2020

nventuro commented May 8, 2020

nventuro left a comment

Choose a reason for hiding this comment

nventuro May 8, 2020

Choose a reason for hiding this comment

nventuro May 8, 2020

Choose a reason for hiding this comment

KaiRo-at May 11, 2020

Choose a reason for hiding this comment

nventuro May 12, 2020 • edited Loading

Choose a reason for hiding this comment

KaiRo-at May 29, 2020

Choose a reason for hiding this comment

AC0DEM0NK3Y commented May 8, 2020

nventuro commented May 8, 2020

AC0DEM0NK3Y commented May 8, 2020 • edited Loading

KaiRo-at commented May 8, 2020 • edited Loading

AC0DEM0NK3Y commented May 8, 2020 • edited Loading

stale bot commented May 29, 2020

nventuro commented Jun 3, 2020 • edited Loading

stale bot commented Jun 18, 2020

KaiRo-at commented Jun 19, 2020

stale bot commented Jul 4, 2020

KaiRo-at commented Jul 5, 2020

frangio commented Jul 8, 2020 • edited Loading

nventuro commented Jul 9, 2020

frangio commented Jul 9, 2020

stale bot commented Jul 25, 2020

KaiRo-at commented Jul 29, 2020

KaiRo-at commented Jul 29, 2020

stale bot commented Aug 14, 2020

KaiRo-at commented Aug 14, 2020

frangio left a comment • edited Loading

Choose a reason for hiding this comment

frangio Aug 19, 2020 • edited Loading

Choose a reason for hiding this comment

KaiRo-at Sep 16, 2020

Choose a reason for hiding this comment

frangio commented Aug 19, 2020

This comment has been minimized.

KaiRo-at commented Sep 16, 2020

This comment has been minimized.

stale bot commented Oct 17, 2020

nventuro May 12, 2020 •

edited

Loading

AC0DEM0NK3Y commented May 8, 2020 •

edited

Loading

KaiRo-at commented May 8, 2020 •

edited

Loading

AC0DEM0NK3Y commented May 8, 2020 •

edited

Loading

nventuro commented Jun 3, 2020 •

edited

Loading

frangio commented Jul 8, 2020 •

edited

Loading

frangio left a comment •

edited

Loading

frangio Aug 19, 2020 •

edited

Loading