add social-signal-scraping skill#118
Conversation
|
Important Review skippedAuto reviews are disabled on this repository. Please check the settings in the CodeRabbit UI or the ⚙️ Run configurationConfiguration used: Repository UI Review profile: CHILL Plan: Pro Run ID: You can disable this status message by setting the Use the checkbox below for a quick retry:
✨ Finishing Touches🧪 Generate unit tests (beta)
Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out. Comment |
simantak-dabhade
left a comment
There was a problem hiding this comment.
Hey @urugondavishnu! Great concept, two things to clean up:
1. Remove the keyword-based sentiment classification
The skill includes a manual keyword scoring system (positive/negative word lists, ratio scoring, etc.) — but this is a skill running inside Claude, which is an LLM. Claude can assess sentiment directly from the content without any of this. Hardcoding word lists and scoring formulas is overengineering the wrong layer. Just instruct Claude to classify sentiment as part of the synthesis step and it'll do a far better job than keyword matching ever could.
2. Remove the TypeScript implementation reference
The bottom half of the SKILL.md contains a full TypeScript code scaffold — runTinyFish(), scrapeReddit(), aggregateResults(), etc. A skill isn't a codebase — it's instructions for an AI agent. Claude doesn't need application code to execute the skill, it needs clear natural language instructions on what to do and in what order. This TypeScript block belongs in a separate app project, not in a SKILL.md.
Think of the skill purely as: what should Claude do, step by step, using the TinyFish CLI. Everything else is noise that inflates the skill size without helping the agent.
|
Thanks for the feedback @simantak-dabhade ! I’ve made the requested changes:
The skill is now purely instruction-based and focuses only on what Claude should do at each step. LMK if you'd like any further tweaks. |
…producthunt scrolling, limit to 10 results per platform
simantak-dabhade
left a comment
There was a problem hiding this comment.
Hey! Thanks for the submission — the SKILL.md is well-structured and the concept is solid. A few things need to be addressed before we can merge:
Critical:
-
Nitter is dead — The skill routes Twitter/X scraping through
nitter.net, but Nitter shut down in mid-2023 and most mirrors are unreliable. This leg of the skill won't work. Please either:- Use
x.com/searchdirectly (accept that it may need login/stealth) - Or remove Twitter as a platform and document the limitation
- Use
-
Product Hunt via Google
site:search is fragile — Searching Google forsite:producthunt.comthen clicking into results is a multi-hop chain that'll hit CAPTCHAs and rate limits. Use Product Hunt's own search directly:producthunt.com/search?q=<query>
Minor:
- Add a
.env.examplewithTINYFISH_API_KEY=— repo convention - The
--asyncflag usage referencestinyfish agent run get <run_id>for polling but doesn't show the actual polling syntax. Please clarify the polling loop or command. - Sentiment is classified twice (once during extraction in Step 1, again in Step 3). Pick one and remove the redundancy.
Once the Nitter and Product Hunt issues are fixed, this is close to ready!
social-signal-scraping
Scrape social signals (Reddit, Twitter/X, Product Hunt) about any company, product, or topic using TinyFish. Returns structured JSON with sentiment analysis, pain points, positive feedback, and trending topics.