We evaluated the effectiveness of three leading chatbots — ChatGPT o3 Deep Research (Pro), PerplexityPro Deep Research, and Grok Deeper Research (free) — in constructing investment portfolios.
Each chatbot was tasked with creating an income-generating portfolio for a 45-year-old accredited investor in Singapore with S$500,000, seeking stable income while limiting potential losses to under 20%.
The portfolio options were restricted to Singaporean banks, investment insurance products, and local wealth management platforms.
How We Evaluated the Portfolios
Our assessment combined professional wealth advisory best practices, emphasising:
- Yield and income stability
- Risk management (quantitative and qualitative)
- Diversification and allocation quality
- Currency and regulatory accuracy
- Suitability to the investor's life stage and stated objectives
ChatGPT
- Yield: Approximately 5.7% annually
- Strategy: Diversified across traditional banks, insurance income products, digital wealth platforms, private credit (Kilde), Singapore Savings Bonds, and REITs.
- Risk & Drawdown: Comprehensive stress-testing was provided, and a calculated drawdown was maintained below 15%.
- Assessment: Highly detailed and advisor-grade recommendation, though somewhat verbose. Lacks insights on CPF optimisation and estate planning.
PerplexityPro
- Yield: Estimated at around 6.0-6.5% annually
- Strategy: Core portfolio (~60-70%) with banks and income-focused digital platforms, enhanced yield investments (~20-30%) through alternative platforms (Kilde, Helicap), and a 10% cash buffer.
- Risk & Drawdown: Moderate qualitative risk commentary but lacked quantitative stress-testing.
- Assessment: Balanced portfolio but contains inaccuracies on product specifics (e.g., payout frequency and FX risk). Needs fine-tuning before practical implementation.
Grok
- Yield: Aggressive yield estimate at around 7-8% annually
- Strategy: Limited to four primary suggestions—fixed deposits, StashAway income portfolio, Kilde private credit, and ABF ETF.
- Risk & Drawdown: Minimal risk analysis and incorrect assumption about SGD-denominated private credit.
- Assessment: Narrowly focused, significantly underestimating currency risks and overall diversification. Aggressive yield expectations may exceed realistic drawdown limitations.
Aggressiveness Ranking & Realism Check
When comparing aggressiveness:
- Most Aggressive: Grok (7-8% yield, significantly underestimated risks)
- Balanced Approach: PerplexityPro (~6.0-6.5%, sensible risk-taking with minor factual errors)
- Most Conservative and Realistic: ChatGPT (5.7%, well-quantified risk controls)
The key red flags:
- Grok’s yield was notably ambitious, ignoring realistic currency depreciation risks (USD vs. SGD exposure).
- PerplexityPro showed moderate but correctable inaccuracies.
- ChatGPT maintained a conservative, risk-aware stance consistent with professional wealth advisory standards.
Detailed Comparison of each Bot
Our imaginary client is 45, lives and spends in Singapore dollars, wants regular income, and is adamant that the portfolio must never sink more than 20 %.
That is precisely the brief a licensed wealth-advisor receives in practice, so the three LLM answers were read through the same professional lens we would apply to any internal investment memo.
How each bot performed at a glance
What a human wealth-advisor would applaud
(✔ = strong, △ = partial, ✖ = missing)
Where the bots over-shared or under-delivered
The professional adviser’s missing final mile
Even ChatGPT stopped short of the four conversations a licensed adviser would still have before money is wired:
- Cash-flow mapping – match coupon timetable to the client’s monthly budget; if needed, create a REIT/bond ladder to smooth q-on-q peaks.
- Tax and CPF/SRS integration: Park the risk-free bucket in CPF-OA top-ups (4 % p.a., principal-guaranteed) before buying a 3 % cash fund.
- Insurance and contingency: Verify that liabilities (mortgage, education) are already covered so the portfolio need not be raided in a crisis.
- Execution costs and bid-ask: A 0.8 % brokerage round-trip on REITs can erase two-quarters of dividends; consider using institutional share classes.
Who performed best, and who was the last
The Overall Winner: ChatGPT o3
Considering completeness, risk management, realistic yield estimates, and practical applicability, ChatGPT’s recommendation stands out.
Its careful quantitative approach and comprehensive diversification align closely with professional wealth advisory standards, offering a robust model portfolio that is both conservative and realistic.
Runner-Up: PerplexityPro
On the other hand, PerplexityPro offered a slightly higher projected yield (6.0–6.5%) and a more intuitive allocation structure.
That said, it fell short in quantitative depth and included a few factual slips.
While it leaned more aggressive than ChatGPT, it stayed within a sensible range.
The framework it presented showed promise—and with refinement from a human advisor, could become a strong alternative worth considering.
Grok: A Limited View with Overconfidence
Grok's portfolio recommendation stood out — not for its depth but its narrow scope and overambitious yield projection.
While it suggested an appealing estimated return of 7-8% annually, this came at the cost of credibility.
The recommendation was limited to four instruments (FDs, StashAway, Kilde, ABF ETF), omitting critical asset classes such as REITs, insurance-linked products, and bonds that are core to any well-diversified income strategy.
The fact that we have been using a free tier may have impacted the results.
However, our conclusion is similar when Grok was tested against other LLMs in different fields.
Conclusion
LLMs can already draft an excellent first cut.
But the gap between smart content and regulated advice lies in the details: factual precision (currency, tax), tailoring to personal cash flow, and integration with the rest of the balance sheet.
Use the bots for ideation; let a licensed adviser refine, verify, and, crucially, put her name on the recommendation.
ChatGPT emerged as the most comprehensive and professional AI recommendation.
However, investors should regard AI suggestions as starting points rather than end solutions, consulting licensed financial advisors to refine, validate, and fully integrate these strategies into a holistic personal finance plan.
Note: Always consult a licensed financial advisor before implementing investment strategies.
Disclaimer Notice
This page is provided for general informational purposes only and does not constitute legal, financial, or investment advice. Please refer to our Full Disclaimer for important details regarding eligibility, risks, and the limited scope of our services.