Request for Comment: Data Economy Index (DATA) Updates

So basically you picked 25% max because so does DeFI Pulse, even though
that means they clipped UNI to that level during a time when DPI FAILED to
keep up with the benchmark ETH HODL strategy? Seems a steep price to pay?
Why not 33% or even 40%?

Thank you for the thoughtful comment, @Crypto_Texan ! Glad to hear you like the idea of the DATA index - now, let’s see if we can persuade you that this is the right implementation :grinning:

Kiba and I received feedback in the original Data Economy Index proposal that we should consider wrapped and derivative tokens to include assets like Filecoin, Helium, Arweave, and Akash Network that are not natively ERC-20 tokens.

The Index Coop is clearly already comfortable using wrapped tokens given that we already have two products using Wrapped Bitcoin (WBTC, BED and BTC2x-FLI.

Still, your point is duly noted that we definitely need to further research and understand the trade-offs of using assets like RENFIL backed by RenVM / Ren Protocol before an IIP is submitted.

When I first joined the Index Cooperative, I remember thinking it was very strange that Augur was initially included in DPI and not Numeraire. After careful consideration, I don’t think either belong in that index because they do not fit into either lending (i.e. AAVE, MKR, COMP, KNC, CREAM) or exchange (i.e. UNI, SUSHI, LRC, BAL) categories which currently dominate the DeFi industry and therefore the DeFi Pulse Index.

Numeraire is one the quintessential tokens in the Data Economy category because it’s value comes from incentivizing data scientists to submit models/predictions (i.e. off-chain data providers) based on private, encrypted datasets (off-chain data & data providers) to run the ultimate hedge fund. I do not think Messari or CoinGecko are properly categorizing Numeraire; CoinMarketCap is closer to the mark by categorizing it as AI & Big Data.

For what it’s worth, I was introduced to Crypto through the Numerai competitions in which I competed as a data scientist. NMR was actually the first cryptoasset I ever owned.

I think there is definitely potential for a “Web3” style index at the Index Coop, although that is not what we are trying to achieve with the Data Economy index at this time.

In general, I would be extremely hestitant to rely on categorization from Messari, CoinGecko, CoinMarketCap, etc. for building an index. We used several categories to create an initial group of tokens for consideration in the Data Economy index and then designed a token inclusion criteria that we believe captures the theme we defined.

In my view, a huge part of the value methodologists bring to the Index Coop is category definition. There is massive value in creating and defining a new category, and this is what @Kiba and I have sought to do.

For example, note that DeFi Pulse Index does not rely on CoinGecko or Messari to determine what is a DeFi token/asset/project. The folks at DeFi Pulse had a huge part in defining DeFi as a category (i.e. the TVL metric comes from DeFi Pulse) and they are reaping massive rewards for both themselves and the Index Cooperative as a result.

1 Like

@TJB2K I’m not sure I understand the point you are making.

Do you think DPI underperforming ETH in USD-terms over a relatively short time period invalidates DPI’s methodological choice to cap individual assets at 25% after the rebalancing phase? I do not think it does.

Yes, a 25% cap is arbitrary but so is 33% or 40% cap. How are these better alternatives? If anything, they allow the index to become further concentrated in a handful of assets, an undesirable feature for an index product.

Do you think DPI methodology should be changed to have a 33% of 40% cap for individual tokens? I don’t think it should. The point of an index is broad exposure to a theme/industry/category/etc., not to concentrate in an individual asset.

Aside from the max cap for individual tokens, what do you think about the proposed product?

1 Like

Hey Thomas, thanks for incorporating comments from everyone into this next iteration of the proposal. I certainly like the distribution of weights between tokens and no significant concentration of positions.

I generally like the simplification of the token weight methodology to a simple market cap weight with a 25% single position cap. One comment around liquidity. I think that now that you’ve increased the number of tokens in the portfolio, you can actually have a liquidity weight component that will not materially advantage LINK but might reduce weight to some of the less liquid tokens.

One of the reasons to consider the liquidity weight component - let’s say DATA goes above $5m in AUM in the first month (it probably will but depends on market conditions and any incentives). Then you have to manually and subjectively decide what to do with RENFIL weight, all while managing a very tricky rebalance. Adding liquidity weight (at 25% but you can model others as well) would help push that point a bit further. Can experiment with square root of liquidity weight as well, if LINK advantage is too much.

Even if you don’t use liquidity weight, it might help to think through and outline a process for dealing with the above situation. So, at the very least, the buyers know what you going to do or which steps you are going to follow to arrive at a decision.


We’ve already scheduled a call with the renFIL team to discuss improving liquidity. Doesn’t prevent the problem going forward with other tokens so might be worth considering.


Great revision from the initial post, the name $DATA is very clear and catchy.


I am happy to agree that 25% is arbitrary, and based on a third party
decision. Raising the arbitrary cap to, say 33% would still allow for
diversification, since most of the benefit from diversification comes in
the first 8-10 tokens. The more important issue is how to do
re-balancing/re-weighting when/if new tokens enter the index, and how often
this takes place. If you expect lots of new hot tokens to enter over the
next year, then a higher arbitrary cap allows more exposure to the leader
in the short term, and ability to add new entrants later. A lower
arbitrary cap at the beginning means less exposure to the market leader in
the short term. It depends on how dynamic you expect the index to be.
Given that all of this is outsourced in the case of DPI to Defi pulse, I
guess it is up to them.

1 Like

Happy to hear that you like the changes we made for this iteration of the proposal!

This is excellent feedback - will absolutely do this before submitting a formal IIP.

@TJB2K I understand your point now.

I think raising the cap on any individual token is one way to address your concern. Changing the rebalancing period from monthly to quarterly would be another (perhaps better?) way to allow more exposure to tokens that have been performing well in the short-term. For instance, most TradFi index products like the S&P 500 rebalance quarterly, not monthly. I suspect DPI started with monthly rebalancing because of all the new DeFi projects they wanted the index to capture quickly (Uniswap was not even in the index at launch).

What do you think?

1 Like

Great work on this @Thomas_Hepner & @Kiba - love all the thought and effort that has been put into incorporating the community feedback and improving the product. So just want to say out of the gate that I really like this product - would buy for me.

A few points of feedback:

So this is the very core of the theme behind DATA. I love the theme. However, I can see how this can feel like a grey area to a lot of people, and when it comes to indexes, people generally want pretty strict lines drawn that make it easy to trust what would or wouldn’t be included in the index. How do you see this? How do you plan on maintaining the line of “data-based services or products”?

I think something like this ^ would go a long way in solidifying the theme that you are trying to achieve in the communities and investors eyes.

I agree with @verto0912 - it would be great to see the liquidity weight modeled here. I would hate for this product to be hamstrung from the beginning due to liquidity issues!

And I think my broader feedback for the community is that I think we have an opportunity here to be a first-mover on a really large theme!


Yes…I think 25% max cap is too low for a brand new index in fast moving DeFi…33% allows plenty of space for new entrants without eroding the share of the likely future dominant winner…DPI capping UNI at 25% is not a good move IMHO

I’m in favor of the proposal though the term “data economy” isn’t a term that I hear often and would suggest naming it “Big Data Economy Index”, where Big Data is a more popular/Googled term and likely to improve marketing impact.


Great work! I think I cannot comment on anything as my concerns were addressed. I just wanted to give a big hand for the hard work and for including a non-native erc20 token on the index.


Not much to add here other than saying i’m on board with this DATA index big time! the product (or the appeal of it) wasn’t initially clear to me but this post patched those gaps in knowledge up pretty immediately.


@Kiba @Thomas_Hepner


Yes the naming could be much stronger and more memetic. Open to any and all suggestions. Dweb, web3, etc. Are more popular terms in crypto and big data, data economy, etc. is popular in enterprise world.


This has already been a topic internally when looking to add more tokens to the index. The answer we’ve agreed to so far is “data-based” means the product itself is data vs a product that uses data to improve business/operations.

So for example Chainlink fits “data-based” because the nodes are selling data directly but Uniswap does not even though they used data to guide architecture to v3. Does that make sense?

@Thomas_Hepner is that an accurate paraphrase?

1 Like

Hi frens, :wave:
New poster; long-time fan of the Index Coop project.

I wanted to chime in and voice my strong support for the Data Economy Index. :muscle::white_check_mark:

We at ConsenSys Codefi work with a lot of new DeFi users both retail and increasingly institutional (see MetaMask Institutional). New DeFi users are well versed in the big crypto trends (DeFi, NFTs!, Data Economy, DAOs) but they don’t have the expertise to research hundreds of tokens and make the right plays. They want an INDEX!

The Data Economy / web3 / decentralized web has increasingly become a focus of our customers, VCs, the media, and even mainstream users (normies). The increasing number of data exploits, Big Tech’s monopolies, and DID momentum have all played into the Data Economy becoming a massive sector over the next few years.

I myself believe that web3 / Data Economy represents the next big advance in the internet, after mobile, and have worked closely with the Filecoin team to make this a reality over the last few years. I believe that DeFi will be the financial services layer for this internet evolution and have championed the creation of renFIL as a wrapped version of Filecoin and gotten it into Aave as a lending market. I’m happy to talk through the critiques of renFIL if needed but believe it’s the best wrapped FIL asset on Ethereum today.

Anyway, the DATA Index solves a real customer need that we’re seeing, and I believe will become super popular as the crypto data economy goes mainstream over the coming years. :white_check_mark::white_check_mark::white_check_mark:

Thanks for reading! :pray:


Just wanted to say I really think this should be our next index product for Index Coop. I have seen the commitment from the methodologists @Thomas_Hepner & @Kiba to get this index created in the community plus all the work that they have general done to improve the Index Coop as whole. I believe this could really move the needle for index coop in terms of AUM. I am looking forward to seeing this Index being a reality.


Hi @Thomas_Hepner and @Kiba

Sorry for the delay in getting back to you on this.

I must say I like the changes, it’s clear you’ve considered the feedback and thought about how to address it. :pray:

I like having more components as 4 certainly felt like it was too concentrated and I don’t see a problem with using renFIL - (There is a larger pool of liquidity behind that bridge and Uni v3 would have solved some of the problems we had with wDGLD in CGI).

Thank you.

One technical note:
Any deviation from pure market cap (liquidity factors/caps) actually results in more rebalancing being needed.

e.g. If LINK is 25% and underperforms the rest, then we will be buying it (and selling more of the others) at the next rebalance. Likewise, if LINK overperformed to become 30% then we will be selling Link and buying the rest.

[Pure market cap is sooo much simpler]

I’m not saying we should do away with the 25% cap on DATA (or DPI), just that there is an impact on the rebalance size - Rebalancing automation is something the EWG are working on (and Quarterly rebl;ances reduce the frequency of such work).


I had this exact idea just last night. Thank you for taking the time to work through all of the details. I think the $DATA index will be crucial.