Letting a team of AI agents manage my crypto portfolio

Timothy Engqvist Johansson

26 Aug 2025 — 11 min read

I'm the founder and creator of the AI agent platform Nelly, and while experimenting with a Crypto Analyst agent, I got curious about its real-world performance. So I decided to let a team of AI agents create and manage a crypto asset portfolio for me, actually three portfolios: one conservative, one balanced, and one aggressive. The agents will choose which crypto assets to research, analyse them one-by-one with a 6-month outlook, and then assemble portfolios that match each risk level based on those reports. I'll have the agents manage these portfolios monthly and track their performance against a few benchmarks.

Note: These aren't day-trading bots chasing patterns. They're investment agents that construct a portfolio from their own research and then monitor it periodically.

The AI agent platform

Nelly is a platform for creating AI assistants using natural language. Agents in Nelly are built by breaking a problem into specialized sub-agents, each with its own set of instructions. They run locally on your computer and can use tools such as a browser, a calculator, and a JavaScript runtime. The LLM used is Anthropic's Claude 4 Sonnet, and I used ChatGPT to help draft the specs for the agents in this article. I've published them to the Nelly Marketplace so you can see how they work, try them yourself, or even improve them and make your own versions.

Scouting for crypto assets

To analyze crypto assets, the agents first need a shortlist. For this I created the Crypto Token Scout agent, which compiles a diversified list of tokens based on constraints like "exclude stablecoins", minimum 24-hour volume, and more.

I then asked it for 20 good candidates:

Suggest 20 tokens that are good candidates to research further for inclusion into a 6-month portfolio.

And in under 10 minutes it came back with these 20 crypto tokens:

Symbol	Name	Network	Sectors	Score
BTC	Bitcoin	Bitcoin Network	Store of Value	0.846
ETH	Ethereum	Ethereum Network	Smart Contract Platform	0.812
SOL	Solana	Solana Network	Smart Contract Platform	0.792
XRP	XRP	XRP Ledger	Payments	0.776
LINK	Chainlink	Ethereum Network	Oracle/Infrastructure	0.724
ADA	Cardano	Cardano Network	Smart Contract Platform	0.720
AVAX	Avalanche	Avalanche Network	Smart Contract Platform	0.692
UNI	Uniswap	Ethereum Network	DeFi/DEX	0.676
ENA	Ethena	Ethereum Network	DeFi/Yield	0.676
ARB	Arbitrum	Arbitrum Network	Layer 2/Scaling	0.620
LTC	Litecoin	Litecoin Network	Payments	0.604
AAVE	Aave	Ethereum Network	DeFi/Lending	0.589
APT	Aptos	Aptos Network	Smart Contract Platform	0.580
XLM	Stellar	Stellar Network	Payments	0.572
HYPE	Hyperliquid	Hyperliquid Network	DeFi/DEX	0.557
NEAR	NEAR Protocol	NEAR Network	Smart Contract Platform	0.548
POL	Polygon	Polygon Network	Layer 2/Scaling	0.548
DOT	Polkadot	Polkadot Network	Interoperability	0.532
SHIB	Shiba Inu	Ethereum Network	Meme/Community	0.524
MNT	Mantle	Mantle Network	Layer 2/Scaling	0.512

Read the entire output with methodology here: https://timothyej.com/crypto-token-research-queue-20-selected-assets/

Reviewing the list

The list contains 20 tokens as requested and they all have a minimum 24h volume of $5M, are at least 30 days old, and listed on at least 3 exchanges. The diversity caps are also respected (max share per sector: 35%, max share per network: 40%) and the list has no stablecoins.

While the list is correct, the summary has a couple of errors:

⚠️ There are 6 Ethereum Network tokens, not 5.
⚠️ The "data fetched" timestamp should be 13 August 2025, not "2025-01-13"

Conclusion: These are minor mistakes and in this case they don't matter as the summary is not being used down stream.

Researching and analyzing the crypto assets

With a diverse list of crypto assets, I built the Crypto Analyst agent to research and analyze each token and output a comprehensive, decision-ready brief for each:

I had the agent analyze each candidate one-by-one using this prompt:

Analyze [Name]'s ([<Symbol]) performance and outlook for the next 6 months, including technical analysis and market sentiment.

On my machine it took about 15-30 minutes per asset. I saved all reports as Markdown files for the next step. Nelly agents can only run one task at a time, so I spread this over a couple of days (total: 6h 47m 21s). You can find all reports below, unedited (except for the main title):

Symbol	Name	Report	Time
BTC	Bitcoin	Bitcoin (BTC) Analysis Report - 6-Month Outlook	24m 8s
ETH	Ethereum	Ethereum (ETH) Analysis Report - 6-Month Outlook	14m 11s
SOL	Solana	Solana (SOL) Analysis Report - 6-Month Outlook	17m 56s
XRP	Ripple	Ripple (XRP) Analysis Report - 6-Month Outlook	13m 42s
LINK	Chainlink	Chainlink (LINK) Analysis Report - 6-Month Outlook	19m 41s
ADA	Cardano	Cardano (ADA) Analysis Report - 6-Month Outlook	23m 51s
AVAX	Avalanche	Avalanche (AVAX) Analysis Report - 6-Month Outlook	15m 59s
UNI	Uniswap	Uniswap (UNI) Analysis Report - 6-Month Outlook	17m 46s
ENA	Ethena	Ethena (ENA) Analysis Report - 6-Month Outlook	20m 45s
ARB	Arbitrum	Arbitrum (ARB) Analysis Report - 6-Month Outlook	17m 52s
LTC	Litecoin	Litecoin (LTC) Analysis Report - 6-Month Outlook	27m 10s
AAVE	Aave	Aave (AAVE) Analysis Report - 6-Month Outlook	18m 33s
APT	Aptos	Aptos (APT) Analysis Report - 6-Month Outlook	16m 4s
XLM	Stellar	Stellar (XLM) Analysis Report - 6-Month Outlook	24m 27s
HYPE	Hyperliquid	Hyperliquid (HYPE) Analysis Report - 6-Month Outlook	19m 52s
NEAR	NEAR Protocol	NEAR Protocol (NEAR) Analysis Report - 6-Month Outlook	23m 37s
POL	Polygon	Polygon (POL) Analysis Report - 6-Month Outlook	17m 32s
DOT	Polkadot	Polkadot (DOT) Analysis Report - 6-Month Outlook	26m 9s
SHIB	Shiba Inu	Shiba Inu (SHIB) Analysis Report - 6-Month Outlook	27m 30s
MNT	Mantle	Mantle (MNT) Analysis Report - 6-Month Outlook	20m 36s

Costs

To produce all of these reports cost me ~$130 in LLM costs.

Reviewing the reports

It would have been too much work to verify every report, so I picked 5 of them. I checked both manually and with the help of ChatGPT, and verified all of ChatGPT's claims before including them here. Treat this as a sanity check rather than a full audit.

Note: I did not fix any of the mistakes AI did as that would somewhat go against the idea of having AI manage my portfolios.

AAVE

"$35.61B in Total Value Locked (TVL)" ⚠️ Correct according to CoinMarketCap, but conflicting with Token Terminal ($58B).
"Active Users: +5.8% (94,500 unique addresses)" ⚠️ Seems to be +16.5% (https://tokenterminal.com/explorer/projects/aave/metrics/)

LTC

"Lightning Network: Planned integration for microtransactions" ⚠️ Litecoin already has Lightning Network.
"ETF Confidence: "All-time high" confidence from industry experts" ⚠️ Based on old news, Litecoin ETF decision delayed to October 2025.
"Sharpe Ratio: 2.35 (excellent risk-adjusted returns)" ⚠️ Looks like this was calculated based on simulated historical price data.

ETH

"significant 46.5% recovery from July's low of $2,942" ⚠️ July's low was even lower than that, near $2,400.
"Pectra Upgrade: Scheduled for May 2025, expected to enhance scalability and efficiency" ⚠️ Stale news.

SHIB

"Shibarium L2: Over 715M transactions processed, growing ecosystem" ⚠️ Now at over 1.5B transactions.
"Surpassed 1M holders milestone in July 2025" ⚠️ Seems like it surpassed 1M holders much earlier than July 2025.

LINK

"Rally Performance: +125% from $11.65 June low" ⚠️ That would a 116% increase, not 125%.
"vs Bitcoin: LINK +43.6% vs BTC +8.2% (30-day)" ⚠️ BTC was down almost 3% (30-day).
"vs Ethereum: LINK +43.6% vs ETH +12.8% (30-day)" ⚠️ ETH was up between 18-25% (30-day).
"LINK Reserve Program Launch: July 2025" ⚠️ This was announced on Aug 7.

Conclusion: Based on the reports I reviewed, the outputs are accurate enough for this experiment. However, I do have some ideas on how to improve their accuracy further, but that's for the next iteration.

Allocating the portfolios

With 20 candidates and fresh 6-month analyses, I built the Crypto Portfolio Allocator agent to construct an optimal portfolio for each risk profile:

It computes a risk-adjusted score for each asset from the reports, ranks them, and proposes a diversified allocation matched to the desired risk. The first portfolio took ~36 minutes (including parsing and scoring). The other two took ~5 minutes each since the parsing and scoring were already done. Here are the results:

Conservative

Ticker	Sector	Weight %	Amount (USD)	Notes
LINK	Oracle infrastructure	25.00%	$250.00	Top-ranked asset with strong fundamentals. Key risks: competitive pressure from Pyth/RedStone/Chronicle, centralization concerns, valuation risk after 125% rally
AVAX	Layer 1 blockchain	25.00%	$250.00	High-scoring L1 blockchain with solid fundamentals. Key risks: competitive pressure from Solana, validator concentration, regulatory uncertainty
BTC	L1 blockchain digital gold store of value	25.00%	$250.00	Digital gold standard with proven track record. Key risks: regulatory headline risk, mining pool concentration, exchange security vulnerabilities
LTC	Payment-focused cryptocurrency	25.00%	$250.00	Established payment-focused cryptocurrency. Key risks: high Bitcoin correlation (96%), mining centralization, ETF approval uncertainty

Balanced

Ticker	Sector	Weight %	Amount (USD)	Notes
LINK	Oracle infrastructure	18.92%	$189.23	Leading oracle network with strong fundamentals, though faces competitive pressure from emerging solutions
AVAX	Layer 1 blockchain	15.77%	$157.70	High-performance L1 with strong ecosystem, competitive positioning vs Solana
BTC	L1 blockchain digital gold store of value	15.14%	$151.39	Digital gold standard with institutional adoption, regulatory headline sensitivity
LTC	Payment-focused cryptocurrency	15.14%	$151.39	Established payment coin with ETF potential, high Bitcoin correlation (96%)
AAVE	DeFi	13.04%	$130.36	Leading DeFi lending protocol, governance and security considerations
APT	Layer-1 blockchain	11.48%	$114.80	Innovative L1 technology, token unlock schedule presents near-term pressure
ADA	L1 blockchain smart contract platform	10.51%	$105.13	Established smart contract platform, current valuation concerns (NVT 4.5x overvalued)

Aggressive

Ticker	Sector	Weight %	Amount (USD)	Notes
LINK	Oracle infrastructure	12.58%	$125.77	Strong oracle positioning despite competitive pressure from Pyth/RedStone; valuation risk after 125% rally
AVAX	Layer 1 blockchain	10.48%	$104.81	Solid L1 fundamentals but faces competitive pressure from Solana; validator concentration concerns
BTC	L1 blockchain digital gold store of value	10.06%	$100.61	Digital gold store of value with regulatory headline risk; mining pool concentration concerns
LTC	Payment-focused cryptocurrency	10.06%	$100.61	Payment focus with high Bitcoin correlation (96%); ETF approval uncertainty
AAVE	DeFi	8.66%	$86.64	Leading DeFi protocol with governance centralization and security concerns; high valuation multiple
APT	Layer-1 blockchain	7.63%	$76.30	Emerging L1 with token unlock risk ($54M scheduled); technical breakdown risk below $4.14
ADA	L1 blockchain smart contract platform	6.99%	$69.87	Smart contract platform with valuation risk (NVT 4.5x overvalued); governance controversies
DOT	Layer-0 protocol / Interoperability / Infrastructure	6.71%	$67.08	Interoperability focus with regulatory uncertainty; limited DeFi ecosystem vs competitors
ETH	L1 blockchain DeFi infrastructure smart contract platform	6.71%	$67.08	DeFi infrastructure leader with MACD bearish divergence; validator concentration risks
MNT	Layer 2 scaling solution	6.71%	$67.08	L2 scaling solution with centralization concerns; smart contract vulnerabilities
NEAR	L1 blockchain	6.71%	$67.08	AI ecosystem play with extreme P/S ratio (984x); high volatility (85.6%)
POL	Layer-2 scaling solution DeFi infrastructure	6.71%	$67.08	L2 DeFi infrastructure with network outage risks; intense L2 competition

Costs

These are the LLM costs for allocating the portfolios:

Conservative: $6.4 (includes report parsing and asset scoring)
Balanced: $0.6
Aggressive: $0.6

Reviewing the allocations

All the weights and amounts sum up correctly and the agent allocated the portfolios using the best-scoring assets. I reviewed the scores for the assets in the Conservative portfolio and the only possible error I found was that it included low liquidity / thin markets (+0.4) in the Risk Index for AVAX resulting in a lower Risk Adjusted Score (0.375 instead of 0.450).

Tracking performance

I'm tracking each portfolio against four benchmarks: BTC, ETH, and two crypto indices: NCI and COIN50. NCI is Nasdaq's Crypto Index, and COIN50 is Coinbase's index consisting of the top 50 leading crypto assets. The indices allow me to track whether my portfolios are performing better, worse, or the same as the crypto market overall. And comparing against BTC and ETH will tell me whether diversifying will have a better return than just holding BTC or ETH.

I'll update the performance weekly here:

Next steps: Monitoring and Rebalancing

This isn't about day trading, so I don't want the agents making daily trades. The agent did recommend watching for stop-loss levels, major events, and other things that could justify rethinking positions. It also recommended monthly rebalancing, so I'm going to create a Rebalancing agent and perhaps a Monitoring agent to handle this.

Follow the progress

I'll keep posting updates on performance and on new agents I build for rebalancing and monitoring. Follow along here or on X to stay up to date. To the moon! 🚀

Want to comment? Leave a comment on the post on X.

⚠️

This post is for educational and informational purposes only. It is not investment, legal, or tax advice. The agents and outputs described here are experimental. They may rely on third-party data that can be incomplete, outdated, or wrong. Large language models can produce errors or hallucinations, and models, prompts, or tools may change over time. I make no representations or warranties about accuracy, completeness, or timeliness. Crypto assets are highly volatile and you can lose all of your capital. Do your own research and consult a qualified adviser.