Any recommendations of good hubs like these, sites that aren't massively cancerous or censored to hell? Back in the days all those nice facebook pages and groups would be a personal hosted blog or a small forum, but can't have that anymore
Reddit's next, probably. Outside of a handful of subs, which I don't even need to post in, it's absolute ideological garbage. If I really wanted to post in a reddit like I could just go to rdrama
I just do it for amusement, I did it when I closed my Facebook account like 6+ years ago, whereas it included things like: anything deleted, any like/dislike, friend/unfriend, any RSVP action on any event and any check-in, facial recognition data, etc.
Just recently deleted my Twitch account, while doing an export beforehand and: Twitch is pretty much an advertising company marketed as a streaming site. Pretty much every action is recorded, and with a [datetime, IP address, GeoIP, ASN] tuple on nearly single action, even logging things like EACH INDIVIDUAL MINUTE WATCHED of a VOD or stream, etc. Every pageview, and referrer URL, down to looking between the profile page, schedule page, and such of an account, etc.
Yes, although I'd assume some of that to be summarized data, rather than a separate record for each individual minute, and for context, the columns of just this one dataset are:
@arcanicanis@coolboymew i know at a consumer level stuff like influxdb you feed individual data points. they are compacted depending on policies you set depending on how much you care to pay for that data going forward. it retains individual data pips until it hits some cutoff, where it gets averaged and the like.
i've also seen some of the articles where you have tech like https://traildb.io/ that was made to deal with clickstream trails (doesn't seem to be updated since 5 years; although it.. pretty much does everything it was meant to do.)
Yes, not saying that broad data warehousing “can’t” be done, but the question of the relevance of it. Individualized minutes watched, at a specific time, [general] location, device screen orientation, etc and so on isn’t going to make that much of a difference to warehouse perpetually (or perhaps later summarize, but not the case in this situation) of something over 4 months ago when payouts have already been tabulated.
I can understand the scope for some degree of application telemetry for feedback, but the resolution of it and expansive timeframe of it feels like nothing more than apparatus to collect and sell data in a surveillance state.
And all of it bloats the operational expenses of a platform, unless it solely is propped up for the intent of data wholesale or ‘information sharing programs’, or just typical absurd VC startup spending.