eupolicy.social is one of the many independent Mastodon servers you can use to participate in the fediverse.
This Mastodon server is a friendly and respectful discussion space for people working in areas related to EU policy. When you request to create an account, please tell us something about you.

Server stats:

216
active users

#alignment

0 posts0 participants0 posts today

Current techniques for #AI #safety and #alignment are fragile, and often fail

This paper proposed something deeper: giving the AI model a theory of mind, empathy, and kindness

The paper doesn't have any evidence; it's really just an hypothesis

I'm a bit doubtful that anthropomorphizing like this is really useful, but certainly it would be helpful if we were able to get more safety at a deeper level

If only Asimov's Laws were something we could actually implement!

arxiv.org/abs/2411.04127

arXiv.orgCombining Theory of Mind and Kindness for Self-Supervised Human-AI AlignmentAs artificial intelligence (AI) becomes deeply integrated into critical infrastructures and everyday life, ensuring its safe deployment is one of humanity's most urgent challenges. Current AI models prioritize task optimization over safety, leading to risks of unintended harm. These risks are difficult to address due to the competing interests of governments, businesses, and advocacy groups, all of which have different priorities in the AI race. Current alignment methods, such as reinforcement learning from human feedback (RLHF), focus on extrinsic behaviors without instilling a genuine understanding of human values. These models are vulnerable to manipulation and lack the social intelligence necessary to infer the mental states and intentions of others, raising concerns about their ability to safely and responsibly make important decisions in complex and novel situations. Furthermore, the divergence between extrinsic and intrinsic motivations in AI introduces the risk of deceptive or harmful behaviors, particularly as systems become more autonomous and intelligent. We propose a novel human-inspired approach which aims to address these various concerns and help align competing objectives.
Replied in thread

@Nonilex

👉The #DumbingOfAmerica: The #StultificationOfThePeople👈 1)

(1/2)

After #Reagan successfully started with the dismantling of higher education for the not-well-to-do as part of #Reagonomics 2), the extremist part of #Republicans called #AmericaFirst in the 1930's and 40's, and now #MAGA are now going a step further by axing primary/2ndary ed., and the #Alignment (#Gleichschaltung) 3) of the #Education system through #MAGA-controlled state bodies.

#TheStultificationOfAmerica
The...

What is alignment?

Does alignment imply ignoring the reality of harm through toxic positivity? No.

Alignment:

- Acknowledges the reality of destructive agents, parts of the systems that don't work, and their impacts, while
- Focusing intention and attention on the presence of constructive agents, parts of the system that do work.

#ChangeMakers #alignment

1/3

Good Idea: Corporation Alignment

punyamishra.com/2025/01/05/cor

Just like we worry about AI systems being programmed with goals that might lead to unintended harm, we should also think about how corporations are “programmed” to prioritize profit above everything else. When a business is only focused on making money, it can end up causing damage—whether that's exploiting workers, harming the environment, or ignoring the needs of society.

Joseph Jaworski speaks of the ability to sense and seize opportunities as they arise:

"You have to pay attention to where that opportunity may arise that goes clunk with what your deeper intention tells you to do. When that happens, then you act in an instant. Then I operate from my highest self, which allows me to take risks that I normally would not have taken."

As a change maker, this is an essential skill to cultivate.

#ChangeMakers #alignment

1/3

Continued thread

Third: We eliminate the BS of traditional OKR "cascading."

Cascading OKRs down through the levels of an organization to the individual may sound good in theory, and sells a lot of OKR software, but in reality it burns a ton of time, effort, and overhead (and usually hurts the quality of the methodology adoption instead of helping it).

Instead, we focus on creating "best practice" OKRs at the top two levels of the org, then embrace a flexible bottoms-up approach from there.

Replied in thread

#USpol #Fascism
@pojntfx
#TikTok"s CEO will be at the inauguration of the first convicted-fellon president-elect, jointly with the fascist #TESCREAL #TechOligarch, #Elmo.

As you know from your history classes, what is the most important strategy in the rise of fascism?--The #Alignment of society (#Gleichschaltung.)

With #SuckerBerg having joined the #MAGA fray this week, the unholy alliance of them and the #fascist #Longtermist #Musk #TechBros, #TikTok is priceless for them.

@stavpup

Elevation-Derived Hydrography [EDH] - The USGS’s Rich New Hydrological Features Dataset
--
doi.org/10.2489/jswc.2024.0314 <-- shared paper
--
pubs.usgs.gov/publication/tm11 <-- USGS EDH Representation, Extraction, Attribution, and Delineation Rules reference publication
--
usgs.gov/3d-hydrography-progra <-- shared link to the USGS 3DHP page
--
[in my role, I have the pleasure of working with the valuable EDH process(es) and the data it produces on a daily basis]
#GIS #spatial #mapping #water #hydrology #hydrography #3dep #edh #3dhp #elevationderivedhydrography #opendata #elevation #dem #dtm #interpretation #waterfeatures #usecase #waterresources #floodmodeling #alignment #model #modeling #dataset #naturalresources #costs #benefits #economics #businessuse #publicdata #spatialanalysis #USA #USGS
@USGS

When taking into account #visual #communication,
the online debate around #climate #change is characterized by the presence of #echo #chambers
with only a small fraction of content circulating among both climate activists and climate skeptics.

In cases where the same visual content circulates within the two groups,
the emotional reactions are often opposed,
with reactions that are more defined by the pre-existing climate #ideological #alignment
than with an actual engagement with the content.
tandfonline.com/doi/full/10.10