Early update on Arcadia publishing 2.0: Scientists are in charge, speed is an issue

Prachee Avasthi; Audrey Bell; Keith Cheveralls; Megan L. Hochstrasser; Robert Roth; Wasim Sandhu; Kai Shanebeck

doi:10.57844/arcadia-2a89-51c1

Result Feedback requested Reimagining scientific publishing

Published on Dec 19, 2024 by Arcadia Science

Early update on Arcadia publishing 2.0: Scientists are in charge, speed is an issue

Since starting v2 of our publishing model to restore scientist agency, pub “quality” has been similar, and this approach is more efficient overall. The major downside is that pubs take longer. We’re pursuing solutions to this and other problems but feel we’re on the right track.

Early update on Arcadia publishing 2.0: Scientists are in charge, speed is an issue

Purpose

About six months ago, we switched to a new “version 2.0” or “v2” model for our open publishing approach. We’d strayed a bit too far from our vision of researchers rapidly sharing their science on their own terms, and we needed to remove some top-down control over timelines, content, and process. Instead, our dedicated publishing staff (“pub team”) offers support on an opt-in basis.

This pub presents the initial results from this stage of our experiment. We’ve eliminated a lot of bureaucracy, and our scientists are now fully in charge. The pub team has more time to write meta-reflection pubs (like this!), improve publishing infrastructure and build tools, help more deeply on pubs that need it, and better integrate publishing with hiring. The main problem so far is that we’re sharing less often. Without close oversight, not all of our scientists share promptly. We’ve also shifted a lot of responsibility to managers, who sometimes struggle to keep up with writing their individual pubs. Last, we haven’t seen a change in the utility of engaging with the broader community — our scientists don’t interact much externally on their own, and outside researchers seem to find and interact with our work a bit less often. Overall, though, we think these downsides are manageable, and we have plans to address them.

We based this pub on early data, after completing just 11 pubs via our v2 approach. We’ve since updated this piece to add a new section after releasing 18 more pubs. Most of our early indicators held true, and having more data hasn’t changed our overall conclusions. The biggest difference is that, excitingly, our second wave of v2 pubs were significantly quicker. Taken as a whole, v2 pubs are still ever so slightly slower than v1, but if the current trend continues, it’ll actually end up being faster.

We hope anyone following our publishing experiment will appreciate the update and give us feedback! We’d love to know what you think and hear any ideas for overcoming the issues we raise.

This pub is part of the model creation effort, “Reimagining scientific publishing.” Visit the model narrative for more background and context.
The data we analyzed and the code we used to do so are available in this GitHub repository.
Updated data and code from our September 2025 addition to this pub are available in this GitHub release.

Share your thoughts!

Feel free to provide feedback by commenting in the box at the bottom of this page or by posting about this work on social media. Please make all feedback public so other readers can benefit from the discussion.

Background on Arcadia’s evolving publishing model

We’ve been publishing work on this platform, independently of journals, since mid-2022. Two years into our experiment, we reflected on an internal challenge that had slowly emerged — the deterioration of our scientists’ agency in the publishing process — and decided to rectify it [1]. We refer to our original approach as “version 1.0” and our new strategy as “version 2.0” or “v2.”

We briefly summarize the key differences in our new v2 framework below, but check out our full pub on this evolution for more complete context.

In a nutshell, we realized that — though well-intentioned — our pub process had grown to be too complex and bureaucratic [1]. We required multiple meetings to plan and check in on pubs, and every draft had to go through review by the publishing team. In v2, we’ve shifted to a scientist-led system with just two requirements: byline contributor approval and availability of materials/information needed to reproduce the work. By default, our scientists now decide their own publishing formats, timelines, and engagement approaches without input from pub team. Our pub team still provides support services, but it's upon request from scientists, who choose from a menu of options.

We hoped this streamlined approach would better serve our goals of rapid, impactful scientific sharing while maintaining our commitment to openness and reproducibility. We figured the potential risks would be variation in quality, poor decision-making about commercialization by individual scientists, and lower pub output as a whole.

In this pub, we check how things are going with our v2 model. Read on to learn about the impacts we’ve seen so far!

The results

We’re about six months in, and we’ve applied our new v2 publishing process to 19 total pubs (10 released, one we’re keeping internal for now, and eight in progress). While we’ve been self-assessing progress constantly along the way, we thought now would be a good time to take a more comprehensive snapshot of where things stand so we can reflect and update our readers.

We’ve mapped out our progress in Figure 1, manually approximating the relative size of each change we’ve observed to provide an overall sense of how things have evolved. In the subsequent subsections, we discuss what’s going well and how we need to improve in more detail.

Diagram of how various facets of Arcadia’s publishing model have evolved where the biggest improvements are in areas that enhance the external replicability of the model, and the biggest emerging issue is the slow time to publish. — **Summary of how things are going with our v2 publishing model compared to v1 as the baseline**.
We’ve assigned rough ratings for how much various aspects of our publishing approach have changed in this new version of our model relative to our prior version. We group each readout based on how it connects to the big-picture goals of our publishing experiment — internal experience, speed, utility, and replicability. Keep in mind that these are approximations and only expressed relative to the original model.

What’s going well

The process is streamlined and scientists are in charge

We’ve implemented a suite of changes that automatically streamline our publishing process and shift agency back to our scientists.

Things are much more efficient. We’ve eliminated two hours of required meetings for each pub and three hours of check-in meetings between the pub team and scientific team leads each month. We still do occasional ad-hoc meetings about particular pubs or sets of related pubs, but the number of these hasn’t changed in our new approach. Unnecessary meetings are a huge opportunity cost for the company due to the people-time involved, so this reduction is a win for us.

Scientists now choose what help they get from the publishing team. Our standard v1 publishing process included about seven discrete tasks performed by members of the pub team for every pub (e.g., project management, editing the first draft, cleaning up figures to match our style, etc.). We’ve turned everything we used to do into an expanded menu of optional pub team “services.” Each pub has unique needs. Our scientists can now choose what they want and skip what they don’t. Thus far, each v2 pub receives around three services. One pub got seven services and another received none at all! Our scientists seem to have an intuitive understanding of what will help them most, and this can vary quite a bit from pub to pub (Table 1 shows our most-requested services).

Service	Description
Figure cleanup	Beautify figures, implementing style/branding guidelines
Thorough editing	Copyediting and in-depth feedback on content, structure, visuals, and overall effectiveness
Quick read-through for flow and clarity	Identify if any aspects are particularly confusing, suggest structural changes, provide feedback on visuals, and flag any key scientific details that seem to be missing
Original figures	Visualize methods, abstract concepts, organisms, processes, etc.
Technical consultation	Discuss what’s possible on PubPub and beyond, explore technical feasibility of creative ideas
First-time pub onboarding	Meet-and-greet with pub team, overview of publishing at Arcadia and services offered, etc.
Visual strategy session	Talk through how visuals can support text (including data visualization, illustrations, graphical abstracts, and number/order of figures)

Publishing services that our scientists have thus far requested at least twice.

Listed by number of requests, with most at the top.

In addition to our scientists being empowered, their perceived experience with publishing is also improved. After we release each pub, we send an optional survey to the researchers who were involved. Scientists who’ve published through both our original and new models report that their latest experience was the same or better. That said, we’ve only released a few pubs exclusively through the v2 system so far, and don’t have many survey respondents (n = 6), so this is something we’ll have to monitor over time.

Pub “quality” doesn’t seem to have decreased

One concern about our new system was that with less oversight from our pub team, we’d start releasing pubs with more errors, confusing writing, or other issues. While “quality” is a difficult property to measure, we have a couple ways to track this, and haven’t noticed any major changes. We don’t have a ton of responses, but we haven’t seen a major change in how readers are responding to the form we include at the end of each pub asking about how clear, useful, and replicable it is. For example, based on the 214 responses we got for the pubs we released via our v1 process, each pub was rated "clear" about 85% of the time (as opposed to "a little confusing" or "confusing"). We have only 20 responses across our v2 pubs so far, but 83% of those responses called them “clear.” Responses to other questions are also largely similar across v1 and v2 pubs. We also calculate a measure of readability (the “Flesch reading ease,” or FRE, where higher numbers are easier to read — for reference, U.S. newspapers tend to have FREs around 45 to 50 [2]) for each pub using the scireadability Python package (v0.6.4) [3]. The average FRE is about the same for pubs developed via either our original or new model (~31 for v1 pubs; ~29 in v2). Both of these scores fall roughly within a US school level of “college” and “college graduate,” respectively. While accurate syllable counting for FRE is challenging with heuristics, we believe these FREs are reasonably accurate and will continue encouraging our scientists to write in a clear, conversational style. We don’t see a strong correlation between these data points and whether or not someone used pub team services. Again, we don’t have a lot of data since we haven’t released a ton of new pubs yet and “quality” is tough to define, but we haven’t seen a huge qualitative difference either. We’ll keep tracking this.

How have we maintained this consistent quality? Well, our scientists are pretty good communicators already. And in cases where a concept is tricky to articulate well or a writer is less experienced, they do a great job of getting the help they need from their manager, a collaborator, pub team, or a combo. With more free time, the pub team is even more available to help with pubs that need extra attention.

We’ve also implemented writing evaluations as part of our hiring process. We try to hire applicants who are both philosophically aligned with our open approach to sharing and able to demonstrate their ability to communicate science clearly. It’s okay not to be a brilliant writer with perfect grammar, but the pub process is much easier for scientists who’ve mastered the fundamentals. We’ve also noticed that clarity of writing often correlates with clarity of thinking about complex topics, so it’s a helpful evaluation point even outside of the publishing context.

Last, we started using Grammarly company-wide. In addition to ensuring basic grammatical correctness and improving the structure/flow of sentences, the business-level plan lets us create custom style rules to prevent common errors specific to science writing and to ensure stylistic consistency. This replaced a huge proportion of the manual copyediting we used to do, and our scientists report other positive benefits, like making their writing more conversational, taking the second-guessing out of their process, and improving their writing quality.

We plan to experiment with AI-based writing solutions in the future, hoping not just to maintain quality, but to improve it further.

With less responsibility per pub, the pub team has more time to create core resources

Removing pub team tasks and meetings from our pub process means there’s less of a built-in burden on our team’s time. There are some clear, immediate benefits to our scientists — for example, the wait time for services is shorter, so the pub team bottlenecks pubs less often. We also have more time to invest in pubs that need close attention and help.

Having time freed up has let us reallocate it to bigger-picture pursuits that enhance our overall publishing experiment. Our publishing director has more time to help write “meta” pubs reflecting on our progress as a company (like this one!) and better incorporate writing evaluation into our hiring process. Our project manager has more time to quantitatively track the success of our experiment and build internal publishing infrastructure for our scientists (e.g., pub stats dashboards, automatic social media amplification of pubs, tools for finding relevant research, etc.). We’ve created guides on writing, accessibility, choosing the right pub type, checking first drafts for common errors, and more. We’ve also had time to implement new tools like Grammarly with extensive customized style rules and to collaboratively develop software packages to render figures in our house style [4][5].

This makes our publishing model increasingly replicable. Now that it doesn’t require a full support team, it’s much easier to imagine a lab or a small startup trying our approach. Once we migrate to the upcoming new version of PubPub, called “PubPub Platform,” we plan to share templates and many of the resources we’ve recently created to help others adapt our model.

We previously had an “IP check” as part of our publishing process, and this arrangement occasionally caused delays in sharing content. While legal review of pub content only took a few days at most, punting decision-making about commercialization and patentability to the end of the scientific process could occasionally cause longer delays of several weeks because essential conversations were happening too late.

In our v2 model, we’ve removed the legal check and let our scientists decide what they share and when they share it. They’re free to release pubs whenever they’re ready, but also carry more individual responsibility to think critically about how we’ll ultimately apply our research in the real world. This model helps encourage more forethought about downstream commercialization potential, during even the earliest phases of a new project.

Extricating commercialization-related strategy from publishing coincides with a similar shift elsewhere in the company. In a recent internal audit, our translation team found that rushing into pilots without planning around ideal outcomes or suitability for downstream therapeutic development was a common, preventable failure mode during their initial efforts to establish Arcadia’s startup studio. Company-wide, we’re now much more aligned on having scientists understand what it'll take to spin out a company and frontloading rigorous commercial research into their project plans.

Hearteningly, commercialization discussions haven’t delayed the release of any v2 pubs. And it isn’t just because we’re sharing all of our work without thinking it through — we’ve completed a few pubs that our scientists expediently chose to keep internal until they finish the experiments necessary to understand whether their ideas have therapeutic potential. Our v2 shift and improved recordkeeping also led us to revisit a pub we decided to hold back during the v1 period, and we realized that we should make it public. So overall, the adjustments in our practices around commercialization seem to result in a default of openness, greater efficiency in sharing, and more thoughtful discussions on the key results we shouldn’t share (yet).

One potential downside of this approach is that we’ll accidentally release information that prevents us from patenting or commercializing a discovery. This doesn’t seem to have happened but will likely only become apparent in hindsight. We’ll continue looking out for such occurrences.

Challenges

We have many ideas to address these problems, but it would extend the pub quite a bit to articulate them all. We’d love to hear any suggestions from you!

Let’s address the biggest issue with our new system first. It’s critical to our experiment that we share efficiently and openly at a pace that ensures our findings can benefit others quickly and that any feedback or ideas we receive are still actionable. The most worrisome trend in our v2 pubs is that they’re taking significantly longer to complete. We previously took about 59 workdays to craft a pub from start to finish (mean, n = 55, standard deviation = 33). Pubs developed through our v2 process have taken 71 workdays (mean, n = 11, SD = 37). We also have eight draft pubs at the moment, and as of this pub’s release, they’ve already been in progress for a mean time of 67 workdays. This is far too slow (for us, see below).

How long should a pub take?
It’s important to note that our pubs take much less time to write than traditional research papers. It’s hard to find reliable stats and this varies dramatically from author to author, but it can take anywhere from a few weeks to many months to write a typical manuscript [6], let alone publish it. 71 workdays from typing the first word to seeing it live online is dramatically different.
For us, though, when a pub takes a “long time” to complete, it often means the point person isn’t breaking their research down into modular “chunks” that they can share quickly, or it could indicate a scientific/performance issue. Delays also have major side effects: the process becomes inefficient with fits and starts that slow down other work, the contributors carry the nagging emotional burden of an unfinished task hanging over their heads, the external community misses out on timely information, and we can’t make use of feedback if we’ve already moved onto other projects by the time we release and get critical comments.
We’ve seen how quickly a pub can come together when there’s an external pressure, all without anyone working extra hours or feeling undue stress. So we don’t want to let ourselves slide into a norm where pubs are unnecessarily slow. Based on our experiences so far, we’d say 20–30 workdays is fast for a data-centric pub, and 35–50 is a reasonable goal. Anything beyond 50 workdays is likely counterproductive.

Why are pubs taking so long? It’s probably because the publishing team no longer provides automatic project management. Our team used to sit down with scientists and map out a plan and timeline for each pub. If someone fell behind, we’d remind them and guide them back on track. We’d also check in monthly with team leads to get a sense of all upcoming potential pubs and make sure they were kicked off in a timely manner. Without this close oversight, it’s much easier for scientists to procrastinate, and the longer a pub starts to take, the tougher it usually is to finish.

We may actually be underestimating how long pubs are taking, too. For the pub team, the most challenging part of our v2 model is a lack of visibility. We can’t tell when scientists have started writing a new pub unless they tell us or fill out our “pub kickoff” form. This happens most — but not all — of the time without prompting. We have almost no way of telling when new research could be shared, but the lead scientist, for whatever reason, isn’t writing it up. This limits our ability to track the success of our publishing experiment as a whole and makes intervention more difficult.

Many of the mechanisms we’ve developed for accountability depend on reliable tracking. For example, we have a “slow pub alert” set up for our CSO, a dashboard on a TV monitor in our headquarters that lists how long each pub has been in progress, etc. And though we no longer do automatic project management, we still provide occasional reminders. That said, none of these strategies are useful when we don’t know someone’s writing or should be writing a pub.

A few of our slowest pubs have lingered since before we switched to our v2 system. We count them as “v2 pubs” because they’ve been going through the new process, but in theory, these could have been written much earlier. Thus, some apparent overall slowness may stem from legacy pubs that suffer from problems independent of our publishing model. We hope that new pubs will move more expediently once we clear the backlog, but there are still changes we should make to speed things up.

Another factor contributing to this slow-down is that managers bear more responsibility in our v2 model, which can cause work overload.

Some managers are more overwhelmed

In our prior system, the pub team typically edited the first draft of each pub and its figures before other contributors reviewed the product. This could be slow, especially for less-polished pubs, because the pub team would spend a great deal of time puzzling through confusing information or seeking better ways to visualize data. In our new system, managers work with their reports to get first drafts into good shape before involving the pub team. Qualitatively, this appears to make the overall pub process more efficient because managers are most informed about what’s not accurate or not well-described in early pub drafts and have the domain expertise to know immediately what to suggest instead.

The flip side of this efficiency gain is that managers have much more pub work. Strategizing about and reviewing pub drafts can create hours of work each week, especially for managers with larger teams or more junior team members. We also expect managers to execute and publish research independently, so they have to keep up with both their own pubs and those of their reports. Further, we’ve had a few instances in which managers must complete a pub for someone who’s left the company, which requires scouring over old notes and code of varying quality, completing analyses, and sometimes even redoing work.

Perhaps unsurprisingly, we’ve noticed that most pubs that aren’t kicked off in a timely manner or that linger for months are those for which a manager is the point person. We don’t yet have data from reports about whether waiting on their manager to help with their pubs tends to bottleneck their progress, but we'll track this moving forward.

That said, in a quick poll of affected managers, most find it no more difficult to manage their pubs than before. All rated v2 “better” or “much better” than v1 at both a conceptual level and in practice. They also report an overall better publishing experience in the new model, consistent with company-wide data.

The usefulness of external engagement is unchanged

A final major difference between our v1 and v2 publishing models is our decision to put our scientists in charge of engaging with other researchers about their work. Why engage? We want: 1) other scientists to know about our research so they can learn and build on it; 2) to get feedback from readers who catch critical errors or provide ideas, improving or speeding up our science; and 3) for readers to benefit from insightful comments from other readers. Rather than relying on pre-publication peer review, we’ve proposed that tracking how other scientists interact with our research will reliably indicate its usefulness and validity. To this end, the pub team previously coordinated engagement efforts for each pub, and at times, we tried having a team member dedicated entirely to facilitating interactions with the scientific community. In the v2 model, we’ve put this task entirely on our scientists. We hypothesized that we’d get more useful engagement with outside experts if we made it less like “homework,” and our scientists felt the full responsibility and power of owning the process.

We’ve found that the impacts of our interactions with outside scientists are, in fact, unchanged. We send a handful of scientists to conferences every year, and our company recently held an “open house” event where we spoke more generally about our work with guests. Beyond that, scientist-driven engagement in our v2 system appears to be minimal.

While not the most reliable ways to understand how others benefit from our work, we’ve seen early signs that our v2 pubs receive fewer views on average and less interaction (visits, clones, issues) with their associated GitHub repos. However, comparisons are difficult given the small number of pubs and GitHub repositories we've released in v2. We'll need to monitor this as we release more pubs.

Tracking public comments on our pubs also suggests infrequent or less effective outreach that doesn’t lead to public feedback. We do receive new comments much more frequently than before, but it’s because in August 2024, we started requiring anyone applying to a scientific role to leave a comment as part of their evaluation. Before this, the median number of comments we received each month was four — now it’s 25! Almost all pub comments (~90%) now come from job applicants rather than readers who discovered our work organically or were made aware of it via outreach from our scientists.

Donut charts showing the percentages of different impacts assigned to public comments showing an early indication that the impact of public comments is similar for v1 pubs and v2 pubs — **Comment impact breakdown by publishing version**.
When our scientists assign impact categories to public comments, they can choose multiple categories per comment if needed. Here, we split comments tagged with multiple categories, meaning we count each individual “impact” and calculate percentages based on the total number of impacts rather than total number of comments. “Commenter got an idea” denotes a perceived benefit to the commenter themself rather than to our scientists. “Other” is any impact that appeared fewer than 10 times across v1 and v2 categorizations.
The similarity between these distributions is an early suggestion that our shift to scientist-led engagement hasn't meaningfully changed how public feedback shapes our work.

Are the comments more or less helpful now? We have scientists rate the impact of comments they receive on their research (choosing from one or more options like “No impact,” “Changed our direction/idea we’re following up on,” and “Caught typo/small error”). Though we have fewer rated comments for our handful of v2 pubs so far, the resulting impact profiles are roughly consistent between v1 (n = 223 rated comments, 231 comment impacts) and v2 pubs (n = 28 rated comments, 29 comment impacts) (Figure 2).

In summary, with less outreach to external scientists, fewer people see and build on our work. We do get more comments because we require this of job applicants, but they’re not incredibly useful outside of the hiring process.

Methods

The data we analyzed and the code we used to do so are available on GitHub (DOI: 10.5281/zenodo.14969194).

We calculate workdays in progress by recording the “kickoff date” of the pub (when we held the kickoff meeting for v1 pubs and when the kickoff form was submitted in v2) and the publication or internal completion date. We use the WORKDAY_DIFF() function in Airtable to calculate an approximate number of workdays in progress. This doesn't account for individual PTO, company events, or other external factors beyond weekends. We used this script to calculate summary statistics related to pub duration.

To generate the donut charts and associated data in Figure 2, we used data stored in Airtable and this script (modified here to accept a CSV instead of using the Airtable API) to calculate the number of times our scientists assigned a given impact. We also used the Airtable summary bar to calculate how many comments were submitted by job applicants after our switch to this requirement. The pub’s point person assigned or approved all comment impacts.

We calculated a Flesch reading-ease score for each pub using scireadability (v0.6.4) in this script.

We used Grammarly Business to help clarify and streamline our text. We also used Claude (3.5 Sonnet) to suggest wording ideas and then choose which small phrases or sentence structure ideas to use, to write longer stretches of text that we edited, and to help write and debug code. We used Gemini 2.5 Pro to help write and clean up code for our September update to this pub. We also used GitHub Copilot to help clean up code.

Key takeaways

The good

The process is streamlined and scientists are in charge
Pub “quality” doesn’t seem to have decreased
With less responsibility per pub, the pub team has more time to create core resources
Decision-making around commercialization has matured and hasn’t delayed sharing

The bad

The usefulness of external engagement is unchanged
Some managers are more overwhelmed

The ugly

Pubs take longer, so we share less often

Fast-forward: How have things changed more than a year into v2.0?

We released the first version of this pub in December 2024. We’ve added this new section (and made some tweaks to the “Purpose” and “What’s next”) eight months later, after collecting more extensive data on our publishing v2 model. This update covers pubs released between December 15, 2024 and August 15, 2025. In that time, we released 18 new pubs, bringing the v2 total to 34. While we didn’t repeat all our initial analyses, we walk through some more fully informed conclusions in the following subsections.

Pubs are starting to go much faster

TL;DR
There isn't a significant difference in speed between v1 and v2 pubs. But v2 pubs appear to be getting faster as time goes on.

Overall, the average time to publish for all published v2 pubs is 60.8 workdays (n = 34, SD = 42.6, 95% confidence interval [45.9, 75.7]). This is slightly slower than the v1 average of 56.4 workdays (n = 54, SD = 32.4, 95% CI [47.5, 65.2]), but the difference isn't significant (p = 0.606, Welch t-test).

We’re encouraged by what we find when we look at the “initial” cohort of v2 pubs and the ones released since our original update. The initial 16 v2 pubs we originally reported on (some of which were in progress at the time and have since been published) took an average of 84.4 workdays to complete (SD = 41.4, 95% CI [62.4, 106.5]). The 18 “new” v2 pubs (kicked off and published since our last update) were completed in an average of 39.8 workdays (SD = 31.9, 95% CI [23.9, 55.7]). This represents a statistically significant improvement in publishing speed of about 44 workdays (p = 0.0016, Welch t-test). We’re still dealing with small sample sizes, but this is a very positive direction.

We believe there are a few reasons for this change. One is that we began publishing “notebook pubs” — findings shared directly as computational notebooks rather than sharing code and narrative split into separate artifacts [7]. While we haven’t released enough of them to say if they’re significantly faster as a rule, many of our quickest pubs are now notebook pubs.

Also, people are getting used to v2, and our tooling has evolved. We’ve chipped away at certain required steps, increased AI and automation throughout the process, and more of our scientists have published in v2 and gotten comfortable with the tools we have available.

These comparisons between the speed of v1 and v2 should be made with caution, as we need to grapple with a number of confounding variables that could increase or decrease the amount of time a pub takes — personal factors such as comfort with writing, company pivots, the requirements of a pub and its associated research artifacts, personnel changes, scheduling difficulties, and so many more. We don’t interpret these trends as solely being related to the difference between v1 and v2, but they're useful to track as we make decisions about our publishing experiment.

We’d also like to thank Kai Shanebeck for their comment on this pub regarding the confidence intervals for pub speed. We’ve added those stats here to help further clarify the speed differences between v1 and v2.

Quality and external engagement metrics are similar

TL;DR
Scientist-driven engagement remains low in v2, and much of our direct engagement on pubs comes from job applicants. Imperfect “quality” measures are mostly unchanged, but v2 pubs show a different distribution in feedback form responses.

In our first update to v2, we reported on two metrics that can provide insight into the “quality” of pubs: Flesch reading ease (FRE) and responses to our feedback form at the bottom of each pub. The average FRE of v2 pubs (27.6; n = 34) is still roughly the same as v1 pubs (30.8; n = 54), and this difference isn't statistically significant (p = 0.072, Welch t-test). However, we do see some shifts in the feedback forms we receive.

We have a larger sample of responses now (n = 444; 267 for v1 pubs and 177 for v2 pubs). Note that all questions are optional, so n can differ between questions. We also treat responses as independent in these tests and interpret these as exploratory analyses, not definitive conclusions. We don’t see a significant change in how readers rate the ability to find everything needed to reuse/reproduce the work (n = 429, $\chi^2$ test, p = 0.284) or the evidence supporting claims (n = 356, $\chi^2$ test, p = 0.181). There are other shifts, though.

First, the overall “clarity” rating remains high for both v1 and v2, but the distribution of unclear responses has shifted somewhat. Readers of v2 pubs were less likely to label them as “Confusing” (3% vs. 6.6% for v1) and more likely to select “A little confusing” (13.2% vs. 6.6%) (n = 423, $\chi^2$ test, p = 0.025). That said, this isn't significant after multiple testing correction (Bonferroni p = 0.101).

Second, the overall distribution differs in the “usefulness” rating of v2 pubs. Proportionally, “Maybe” useful responses are higher (41% vs. 29.0% for v1) and “Not really” useful responses are lower (11.6% vs. 21.2%) (n = 432, $\chi^2$ test, Bonferroni-adjusted p = 0.024).

Our main metric of engagement, the number of reader comments, is still higher in v2 than v1 and about 90% still come from job applicants. Other indicators of engagement, like GitHub repository visits and clones, are increasing. So are general pageviews, comments, and citations. In the first six months of v2, it appeared that there was some slowdown in GitHub visits and clones, but we now observe that v2 repositories are generally receiving a similar or higher number of clones on a per-week basis, but visits are more difficult to pin down. Rather than relating to any differences in our v1 vs. v2 processes, we believe these differences are more likely related to an increase in traffic generally, greater awareness of our pubs, our requirement that job applicants leave a comment, and varying interest in our different GitHub repositories based on the nature of the content.

A note on comments from job applicants
While our requirement for applicants to comment on our pubs is somewhat unconventional, we've found it can generate high-quality engagement. Though many comments are indeed perfunctory and we do select for overly positive and uncritical responses, some include substantive critiques, detailed reanalyses, and thoughtful extensions of our work that provide genuine scientific value. This approach lets us leverage an existing process to sometimes capture useful feedback without additional effort, and we think it could be even more effective when supplemented with targeted outreach for a fuller picture of community response.

Commercialization decisions still aren’t delaying pubs

TL;DR
No v2 pubs have been delayed by commercialization decisions. We’ve also released all the v1 pubs we’d previously held back to consider for commercialization.

As in our original update, no v2 pubs have been delayed by commercialization decisions. While multiple pubs were “paused” by a scientist after kickoff, none of these delays have been related to commercialization. Additionally, all the pubs that we were holding internally have since been released publicly after discussions with the scientists who worked on them, supporting our hypothesis that overly cautious approaches can delay valuable community feedback without ultimately protecting IP. In a few cases, we found that earlier publication would have been beneficial not only for external feedback but also because the process of preparing work for publication itself routinely helps identify issues and strengthen research. Though changes in company direction have also contributed to this, we view increased clarity around IP as a continuing positive sign for publishing v2 and will continue monitoring.

A note on data

You may notice discrepancies between the numbers we report in this update and our original publication. We’ve since improved our methodology for counting workdays for paused pubs, which caused small shifts in workday counts. We’ve also now released all the pubs we were holding internally — some of these pubs, which we counted as “complete” pubs in our first report, took additional time when authors revisited the content prior to release. Additionally, we decided not to publish one “internally complete” v1 pub from the initial dataset, but instead wrote a new pub with a different focus based on the same data using the v2 process. Further, we omitted two pubs that were included in our original analyses because they were kicked off during v1, work stalled, and then they were restarted and released during v2, thus going through both publishing processes.

All data for our September 2025 update (and a few additional analyses) are available in this folder of our GitHub repo, and all code is here.
All data from our original release of this pub are in this folder, and all code is here.

Are the trade-offs worth it for us?

In a word, yes. We have a lot of ideas to improve, which we’ve omitted for space and because many are pretty specific to Arcadia’s inner workings. But even if all our attempts to improve fail completely, the benefits still outweigh the downsides (Figure 1), and we’re sure this new strategy is a move in the right direction. Scientist agency and model replicability are critical to the success of our experiment, and we’d strayed too far away from those guiding pillars. We may eventually devise yet another new model for publishing, but we won’t go back to our original approach.

That said, we’re hopeful that by implementing our improvement ideas, and perhaps by gathering constructive insights from you, our readers, we can get even more creative.

How might you benefit from what we’ve learned?

We hope you’ve found this pub interesting, but ultimately want you to take something useful away from these reflections. Even if you’re not exploring new models of research sharing, our experiment has surfaced some practical insights that might inform your publishing practices. Our high-level lessons are probably most useful for group leaders or people taking the lead on prepping a manuscript.

Balance agency with accountability

While giving authors full control over publishing can foster creativity, some structure helps maintain momentum:

Consider providing more formal training in writing rather than relying purely on an apprenticeship model where junior scientists learn via observation, or, when giving feedback to authors who are learning, make sure to include comments specifically on how to improve their writing
Light accountability or project management may be the best compromise between heavy involvement and a hands-off approach — consider regular asynchronous check-ins and occasional in-person meetings with new writers, like grad students
Establish clear, realistic timeframes for text and figure drafts, as well as overall completion of the manuscript
Ask the lead author(s) what support they think they’ll need most before you start and check in routinely to see how this changes
Consider tracking time spent on manuscript preparation to identify and address bottlenecks, or at least reflect on this often
Set up lightweight practices to flag when projects are moving too slowly or signaling problems unrelated to writing
Start writing sooner and include input from managers/PIs earlier to prevent a heavy lift at the end

Lean on tooling and automation

By embracing the right technological solutions, you can reduce manual workload and free up you/your team's energy for higher-value work:

Tools like Grammarly can help a lot — it could be worth paying for Grammarly Business and creating custom style rules for your lab or group to cut down on time spent fixing basic issues
Automate as much as possible! Pre-schedule reminder emails, use tools like Geekbot on Slack to get weekly updates on writing progress, create templates or scripts for quick figure design, automatically re-up social media posts about papers, etc.

If you want feedback, engage

If you’re hoping the scientific community will help improve your work through discussion, there are a few ways to focus their attention:

While extensive outreach strategies don’t always lead to extensive feedback, silence essentially guarantees zero engagement. Quick social media posts, emails to colleagues, requesting a PREreview or feedback from us on your preprint, and presenting at supergroups or local symposia are all relatively low-lift ways to make people aware of your research and to encourage critical discussion.
Ask applicants to your lab or company to leave public comments on your preprints or papers. Some sites have built-in comment functionality (e.g., Disqus on bioRxiv), Hypothes.is lets users comment on any URL, and you might even suggest more extensive feedback mechanisms like PREreview or preLights. This isn't just helpful in polishing your science; it also provides deep insight into a candidate’s mindset, expertise, and suitability for a role. Plus, it gives the candidate a chance to consider more thoughtfully whether this is the type of work they’d like to pursue next.
Give someone else feedback and then ask for it in return. If you need inspiration, we curate a list of preprints for which authors have asked for public feedback. Once you’ve added your thoughts, let the corresponding author know and suggest a piece of your own that could benefit from similar open review! Even if they don’t reciprocate, by publicly commenting on someone else’s science, you’ve provided a valuable service to both the authors and other readers.

What’s next?

We’ll keep tracking our publishing experiment and update you when we have new findings, ideas, recommendations, or need input. We’ll also let you know when we step into our next iteration, be it incremental or perhaps a full v3.

We’d love to hear what you think about our initial observations and ideas to improve our v2 publishing model. Are there other aspects of this strategy that we should measure to understand our overall success? What should we do to address the pub slowdown and lack of external engagement?

Share your thoughts!

Provide feedback

Pub details

Content 7 contributors

7 references

Activity 29 discussions

6 social posts

This work is licensed under CC BY 4.0

Prachee Avasthi

Supervision

Audrey Bell

Visualization

Keith Cheveralls

Validation

Megan L. Hochstrasser

Conceptualization, Formal Analysis, Supervision, Visualization, Writing

Robert Roth

Data Curation, Formal Analysis, Methodology, Visualization, Writing

Wasim Sandhu

Validation

Kai Shanebeck

Critical Feedback

Jesús G. Galaz-Montoya on Aug 13, 2025

Hi all,

I’m fascinated by Arcadia’s experimental initiatives to revolutionize scientific publishing! While I’ve alrady left extensive commentary and questions on the pub introducing notebook pubs (https://research.arcadiascience.com/pub/perspective-notebook-pub-format/release/2/), reading in this pub the specific issues Arcadia is facing prompted further ideas that may be interesting to explore in the short-term.

SPEED

1.1) Regarding the issue of speed, making “micro-pubs” the default unit may accelerate publication. Part of what slows down traditional academic publishing is striving to build gigantic, collaborative, multidisciplinary “stories”. Many research papers these days have up to 10 display items with multiple subpanels, as many supplementary figures, dozens of co-authors, and tens to hundreds of references, all of which take years to put together. An important question is whether publishing the same results progressively in smaller units would be more beneficial for scientific progress in the long run. Since Arcadia is interested in speed, it might make sense to increasingly move towards a paradigm where each piece of evidence is self-contained and published by itself. Once numerous pieces have been published, then it becomes appropriate to write a review interpreting them all together.

1.2) It may also be worth it to explore providing incentives: could Arcadia “gamify” publication, such that scientists earn a bonus (or “points” leading to a bonus, whether it’s monetary and/or an equivalent amount of additional days off, with the flexibility to exchange one for the other) for speedy publication while meeting minimum quality bars? The complication here might be how to allocate bonus points for multi-author pubs; perhaps the person playing the role of “first author” should get a higher number of points while other co-authors get a standard amount. Though the following may come with risks and drawbacks, using “status badges” could also encourage even earlier publication: a scientist would tag an unpolished publication as a “draft”, an intermediate development as “stable”, and a final publication as “complete” (or something of that sort), after which the normal “version badge” would take over to track major changes (V1, V2,… VN). Having a “peer-reviewed” badge (and perhaps an associated score) would also be important to provide guidance for readers as to how solid the results are.

1.3) Commute time is known to be an efficiency killer and energy drainer. Arcadia may test whether allowing variable amounts of remote work and accommodating flexible schedules could motivate scientists to publish more rapidly when they’re in the writing stage of a project. Gaining ~2 hours per day spent on commuting and getting ready for the office (and the corresponding energy involved) could predictably be more productively invested in scientists being better rested and/or more focused on writing to meet more aggressive publication deadlines.

PROJECT MANAGEMENT

2.1) Project management is a complex issue. Given that micromanaging can backfire and demotivate scientists while being time-intensive for managers, perhaps Arcadia could explore automated lightweight project management through a “tracker” app (e.g., a dashboard or a command-line interface) that would set expected dates for the main milestones in a pub and track progress in accomplishing them. If it’s automated, this would prevent burdening managers, and would also feel less like micromanaging to scientists. Phases to track could include A) opening a GitHub project for the pub, B) writing, broken down as subsections corresponding with the pub template; e.g., assuming a standard paper layout, this might include B.1) introduction and background, B.2) results section, B.3) figures, B.4) discussion, B.5) methods, B.6) comprehensive referencing, B.7) code and data depositions if appropriate, and B.8) appendixes/supplements. A final step could be C) dissemination, including C.1) dissemination on social media and/or via appropriate listservs and C.2) presenting the work at local, national, and international venues via posters or presentations, C.3) having each scientist actively request expert peer review for their pubs (targeted expert feedback, reproduction validation runs, etc.). Understandably, the sections or milestones to track might vary depending on the different templates for different pub types (wet-lab research, computational notebook, a hybrid of the latter two, commentary/review, etc.). The “tracker” could include automated reminders of tentative milestone deadlines sent to scientists and managers via calendar events, emails, and/or phone texts. The scientists could set their own flexible estimates for each milestone at the beginning of the project (within tolerable margins) and could be given the flexibility to extend the time allocated to each as needed.

2.2) Concerning managers being overwhelmed, it seems that they should be subject to lower publication volume requirements. If non-managers have a target of ~3 pubs per year, this should be 2 pubs for managers, since they’re assisting with so many other pubs already by virtue of tracking, managing, and supervising them.

EXTERNAL ENGAGEMENT

3.1) Checking Arcadia Science’s X (Twitter) account (which I’ve been following since earlier this year), a missed opportunity was immediately apparent: Consider switching to a “Verified” or “Premium” account, which comes with several benefits including expanded word limit, visibility boosts for posts among followers (or among all users in general depending on subscription tier), and the ability to edit posts, among other perks. Other less obvious but real advantages are that having a blue checkmark (or even better, a gold one) gives followers an increased impression of interfacing with an “official”, “important”, “famous”, and/or “influential” account. Arcadia Science would probably benefit from conveying more seriousness and commitment to dissemination on social media. Also, while Arcadia has amassed a good following (~11K), it would probably grow faster if Arcadia increased the number of accounts it follows (currently it’s at a modest 44) by virtue of having an expanded network. An increased use of hashtags may also be beneficial, as well as posting more actively, even if it’s not purely Arcadia’s research, but also engaging with relevant scientific topics in general. A social media editor could help strengthen Arcadia’s online presence.

3.2) Citing from the pub: “We send a handful of scientists to conferences every year”. This level of dissemination through scientific gatherings seems to be on the extreme low end. Increasing conference attendance may pay off in the long run. Many labs in academia target sending every single one of their scientists to at least one national and one international conference each year, in addition to encouraging attendance at local events that don’t require travel. The ideal is to present a talk or a poster at such venues, but having scientists attend events is beneficial even when they’re not presenting: they network and talk about the company’s science and their projects during reception, breakfast, lunch, coffee breaks, and dinner. They also bring back knowledge about the latest developments in their respective fields.

3.3) In my previous comment on the pub introducing notebook pubs, I proposed a possible cross-publication strategy to “hijack” mainstream journals and preprints by publishing Arcadia’s research (or a version of it) in them, in parallel to the pubs and notebook pubs. While it may seem counterintuitive, I predict this would draw heightened attention to Arcadia and help advance its mission in the long run. Traditional journal publications are a finalized, inflexible product; engaging in the old publication model while it’s still dominant (perhaps exclusively open-access journals) would be an easy way to refer a much wider audience to Arcadia’s “living documents”, which may predictably amass more citations in the long run if the mainstream journal publications make it clear that a “living and evolving” version of a research study is hosted on Arcadia’s pub sites (which the mainstream may view as ‘preprints’ and thus might still consider eligible for mainstream publication, particularly if publication occurs before any significant “peer review” of Arcadia’s pubs has taken place).

3.4) Does Arcadia actively request feedback by reaching out to experts privately as editors in traditional journals would? Or is the strategy to just wait passively for other scientists to encounter a pub and hope they’ll feel a desire to provide feedback of variable length and quality? If Arcadia isn’t actively reaching out to targeted experts to review each pub, this might potentially be an area of opportunity. Reaching out to experts to request feedback could itself constitute a promotion strategy (it provides an opportunity for more scientists to become aware of Arcadia’s research, and its publication models).

3.5) Arcadia could reward its scientists for promoting the pub and notebook pub models by presenting their research at local venues: lab meetings, journal clubs, seminar series, etc., at local universities and biotech companies (this ties back to point 1.2 about “gamifying” publication).

3.6) Lobbying the NIH and NSF to recognize reproducible, “community-peer-reviewed” pubs and notebook pubs towards renewing or acquiring grant awards could serve as a platform for scientific governmental institutions to promote or at least endorse Arcadia’s innovative publication formats while facilitating adoption by scientists in academia.

Megan L. Hochstrasser on Oct 07, 2025

Thanks for all your thoughts, Jesús! We just discovered some issues where certain comments weren't showing up on our site, so I apologize for the delayed response. There's lots here, but I'll address a few of your main questions.

3.3 — We aren't going to publish anything in traditional journals, ever. That was the first company-level decision Arcadia ever really made so it's a pretty firm line in the sand (see more: https://substack.com/home/post/p-169880932). Your point about cross-promo is well-taken, though. I do think there are other ways to achieve this, like soliciting media coverage or attention from bloggers, who share on whatever platform they want.

3.4 — We sometimes reach out to specific authors to solicit feedback, and IMO we should be doing this for every single pub, but this is up to the scientists. The big challenge with engagement (and this applies to some of your other suggestions are engagement + project management too) is that it's essentially our policy that the scientists need to lead this themselves, but they simply don't put much effort into it. When we had someone else doing it for/with them, it didn't work well and it was clear that the people who actually did the work are its most appropriate and best representatives. Internally, there isn't a huge appetite to adjust policy or perhaps judge employee performance based on how much outreach they do. I think if we decide we want much more feedback than what's coming in from job applicants, we'll have to adjust that approach.

General thoughts on rewards/remote work flexibility/etc. — This has definitely worked well for us when we've tried it. When a pub was really taking too long, we've had managers tell the lead scientist to work from home for a week until they have a draft. We did a summer-long internal push that gamified publishing and literally awarded points and various prizes. These approaches would probably be a little too distracting from our scientists' main responsibility (research!) if we used them year-round. That said, we should probably make more use of these strategies!

Kai Shanebeck on Jun 01, 2025

This was a great overview of the updates to your publishing process and very relevant to broader issues in scientific publishing beyond your organization. The irony of commenting on this as part of an application, while you mention that most current engagement stems from applicants, wasn’t lost on me! Still, I think it’s a clever and effective way to spark engagement. From an applicant’s perspective, it was also genuinely helpful and provided valuable insight into how the company approaches scientific communication.

I was curious about what other engagement strategies you’re pursuing. A stronger presence on platforms like Science Twitter (X) and Bluesky could be a good way to build momentum and visibility. Depending on how frequently you collaborate with external researchers, it might also be worth incorporating engagement expectations into research agreements, such as encouraging collaborators to share and promote content within their own institutions. That could be an organic way to broaden participation in review and commentary.

As to the analysis, I think it a little early to make conclusions about the effect of the process on time to publication. I quickly calculated 95% confidence intervals, v1 had an average publication time of 59 workdays (CI: 50.07–67.93), while v2 averaged 71 workdays (CI: 46.13–95.87). The wide overlap in confidence intervals suggests that the observed difference may not be solely due to the process change. Several other factors could contribute, for instance, if the 11 publications in version 2 included more projects involving complex coding, that would likely increase turnaround time due to additional back-and-forth, or projects that had significant interruptions due to employee workloads. A larger sample size would improve confidence in any conclusions drawn. It may also be valuable for the publishing team to also analyze how project type influences publication speed within v2 compared to v1. One practical solution could be to include an expected timeline for each project, allowing papers to be categorized by anticipated duration. This would allow for analysis of the effect of the publication process on the difference between expected and realized timelines. I would use the realized timeline divided by the expected as the response variable which would eliminate the problem of comparing apple and orange projects with potentially varying expected timelines.

Overall, I'm a big fan of your model. Reducing bureaucracy in the publication process is much needed, and I appreciate how strongly the company values scientist agency. At the same time, I’m aware that we scientists aren’t always the best at self-management. I’d suggest adopting a structure somewhat similar to the one journals use for peer review and revision. Have the publication team collaborate with researchers and their manager to set deadlines that are ambitious, yet flexible. Allow for simple, no-questions-asked extensions up to a clearly defined, conservative maximum. Beyond that point, additional oversight could be required.

Sometimes we just need a clear target to stay motivated, and this model strikes a good balance. It encourages timely progress while also accommodating unexpected disruptions without pressuring researchers to compromise the quality of the science to meet a bureaucratic deadline.

Megan L. Hochstrasser on Jul 22, 2025

Great points about confidence intervals! We're preparing some updates to this pub now that we have more data and will definitely report on these in the new release.

Idrisa Rahman on May 29, 2025

This was such a fascinating deep dive! Thank you for being transparent about the experiment and what’s still uncertain. I especially appreciated the reflection that V2 publishing creates space for more thoughtful, open-ended exploration, but can also introduce friction in the absence of traditional deadlines or external incentives.

It made me reflect on how many drivers in academia - impact factor, competition, tenure - reward speed and polish over transparency and genuine curiosity. Stepping outside of that system must be both liberating and a little disorienting. I’m curious whether any internal mechanisms have emerged organically at Arcadia to help recenter momentum, like accountability circles, rotating project champions, or sprint-style “publishing weeks”?

Also, I love the idea of tagging Pubs by organism or question. For those of us exploring with curiosity rather than a specific Pub in mind, it would be exciting to traverse the work thematically and follow evolving threads of investigation across time and teams.

Megan L. Hochstrasser on May 29, 2025

We've done publishing weeks and actually did a whole seasonal pub push called "Summer of Pubs" with prizes to motivate people to finish pubs when we had a big backlog. We offer a weekly "pub work sesh," where anyone can drop in and get some work done with the pub team on hand to answer questions or talk through strategy. Those are the main pub team-coordinated mechanisms — I'd be curious if individual scientific teams have done anything in addition!

We're definitely planning to enable better searching in the future. Thanks for these ideas! I'd love for folks to have much better ways of matching their interests with our work.

Rishi De-Kayne on May 27, 2025

I really enjoyed reading about the move to the V2 publishing model and had a couple of thoughts/questions:

One thing that struck me was the increased time to publication under V2 in the absence of scientists being kept on track by the publishing team. Under the ‘traditional’ academic publishing model there are a host of incentives related to pushing projects over the finish line and engaging with other scientists about the work pre- and post-publication. I wondered whether, typically, the teams contributing to each Pub stay the same throughout the process or whether additional team members could be added in the final stages of preparing a Pub to help get it over the hurdle of finishing the last 5-10% of a project. Perhaps the addition of a dedicated ‘project finisher’ to assist teams some way through the writing process would help bring fresh eyes and a different perspective to support an existing team to more efficiently push projects to completion - something I have seen have a substantial impact. Similarly, the allocation of a ‘spotlighted’ team member (something akin to a corresponding author) who acts as a liaison and representative for engagement with the specific Pub post-publication may encourage more promotion and engagement since readers have more of a sense of who they are interacting with or providing feedback to, avoiding the unbalanced and intimidating feeling for a single reader of discussing thoughts or asking questions with an entire team, or even company, at once.

I also think there is an intrinsic challenge of continued external engagement when Pubs represent smaller modules of research compared to one massive paper that readers know a lab has been working on for years. In my mind, one thing that would help expand continued engagement with Pubs (to beyond those applying for jobs at Arcadia) would be the ability to more easily follow specific avenues of investigation e.g. by taxon, methodological approach, or the various teams carrying out the work. I wondered whether it would be possible to have specific tags/keywords/motivating questions in a window to the right of each Pub that would drop down a list of other Pubs that are connected, allowing readers to traverse from one pub to another on a similar theme. I like that at the start of many Pubs there is a link to one or two connected articles and I wondered if adding a project timeline, allowing readers to move back and forth in time along a specific avenue of investigation, would help illustrate how a project is progressing and allow readers to have a sense of ‘following along’ leading to recurrent engagement. I think that encouraging engagement by early-career scientists (e.g. grad students) will also be key in helping change the broader paradigm of feedback in science (similar to how publishing and commenting on pre-prints has become much more common).

Megan L. Hochstrasser on May 27, 2025

Thanks for all these thoughts, Rishi! I do think it could be helpful for teams to bring in a "finisher" to help push pubs to the end. We offer a service from pub team called "unstick my pub" that's intended for this, but so far, no one has requested it. I think it's usually the case that one person unavoidably just needs to sit down and finish the pub themself, and there's a high activation energy for that. On the engagement side, we do have a pub "point person" who typically spearheads any engagement efforts. We'd like to be able to tag a commenter's name with their role [e.g., "Megan L. Hochstrasser (contributor)"] so that readers understand they're interacting with an author, which also gets at the point you're making, but PubPub doesn't yet offer that feature.

I love your ideas for showing a pub's connections in a side panel of some sort. We originally attempted to create "project narratives" that would provide connective tissue between modular pubs, but it turned out that Arcadia doesn't pursue a single project for long, and longer efforts don't take very linear paths, so it was tough to make them useful. We also found that visitors didn't really engage with those narratives, in part because they're separate pages from the pubs themselves, and readers typically just engage directly with pubs. It would be cool to be able to somehow connect pubs across various dimensions (perhaps leaning on AI to pull keywords/themes from across all pubs) and let readers explore a visualized network map.

Michael Cotten on May 05, 2025

Hi, this paper is really interesting. From your V1 to V2, the mean time to publish increased by 12 days (59 days to 71 days), but the standard deviation stayed the same, around 35ish days, suggesting that all the publications were equally impacted by the change. Even with that, you still see publications finishing below the mean less than the 50-day mark. I'm curious if you see different populations of publishers in your data (i.e. a fast publication group and a slow publication group). In my academic experience, junior scientists (students, postdocs, and junior faculty) have a lot of experience presenting their research visually but not so much in writing. I'm wondering if maybe a "V1.5" approach, where the slow publication group works with the publication team (maybe on a first draft) to get their publications in shape. This could free up time for your more overwhelmed managers as well, while still providing support for those who need it.

Megan Hochstrasser on May 06, 2025

Interesting observation! You're right — we do have people who publish quickly and others who tend to take a long time. We've especially noticed that someone's first pub at Arcadia tends to contain more content and take longer than is likely necessary. We've started doing a publishing onboarding session with new hires where we recommend pub services we think they should request for their first pubs. We also push to have them publish within the first 90 days of their time with us to help set expectations about speed as early as possible. There may be more we can do along the lines of what you're suggesting as well!

Gustavo Hernandez Vargas on Feb 11, 2025

I found this paper particularly interesting as I have been exploring how big data principles from industry—especially Open Source—can be applied to scientific research. With this in mind, I have a few questions:

How does Arcadia Pub ensure its methods remain updated and openly accessible so other scientists can replicate results? I found the "We’ve put this effort on ice" section insightful, but for projects like "Raman spectroscopy enables rapid and inexpensive exploration of biology," I had difficulty following the research progression.
To address the challenge of slow publication, has Arcadia applied any internal strategies to make team members’ notebooks broadly accessible within the company? Specifically, do you maintain a project repository where research data is shared and analyzed automatically? Would something like a "PubOps" pipeline—integrating continuous integration/continuous deployment (CI/CD) principles—help facilitate real-time collaboration and feedback?

Megan L. Hochstrasser on Feb 13, 2025

I like what you’re getting at — it’s definitely an interesting idea. Whether such an approach is viable or worthwhile for us will depend on some of the scientific details, data compatibility, etc., which go beyond my domain in publishing. I’ll pass this along to a few relevant people at Arcadia, though!

Megan L. Hochstrasser on Feb 11, 2025

Interesting angle!

Our methods are up-to-date as of a pub’s release, and if we advance in how we approach something, we release a new pub with more info. In some cases where there’s a direct improvement, we may update the old pub to point to the new one. We also try to track the progression of longer-term projects or focus areas via “narrative” pages. For example, the “Microscopy” narrative provides an overview of how each microscopy-centric pub we’ve released fits into our larger goals and why we followed the overall path we’ve chosen.

To maximize replicability, we also make all of our code and data open. We try to share hands-on experimental protocols when relevant on protocols.io, and if anyone has questions, they’re welcome to leave a comment. We’d love more people to try to replicate our work, since that’s probably the only true test of replicability.
Within the company, we can all mostly see everyone’s notebooks, repos, etc. There may be some content people keep locally, but in general, we’re super open. When we’re doing the same sort of analysis repeatedly, our scientists usually will put effort into creating pipelines along the lines of what you’re describing, sometimes with extra help from our software team. That said, we churn through a lot of ideas, so for most projects, it only really makes sense to do the analysis once before moving onto something new. We do have figure styling packages that takes care of formatting various types of data visualization to match our publication style, which can help cut out a ton of manual work. We’re experimenting with using AI tools to turn internal notes/documentation/etc. into pub drafts as well as “notebook pubs” where we’ll share our analysis directly as Python notebooks without separately writing up a full pub. I’m not sure if this quite gets at your question — feel free to clarify if I’m missing the mark!

Gustavo Hernandez Vargas on Feb 11, 2025

Hi Megan,

First of all, thank you for your time. Regarding the first question, the answer is clear. I have been exploring the Platform section and have found more details about your research lines

On the second point, I completely agree with you. As research evolves, it makes sense to run data analysis only once rather than repeating efforts. However, I see an opportunity to build scalable data pipelines that allow scientists to leverage Big Data concepts like Delta Tables or Data Mesh.

For example, what if the team could have a data platform that ensures datasets from past experiments can be complemented, linked, and mined further? Instead of treating each dataset as isolated, we could establish a framework where new findings integrate with existing data—making historical datasets reusable and extensible.

Following the research on 'Rescuing Chlamydomonas motility in mutants modeling spermatogenic failure' and 'An experimental and computational workflow to characterize nematode motility behavior,' what if we linked findings from these studies while incorporating insights from external literature, such as 'Chlamydomonas as a model system to study cilia and flagella using genetics, biochemistry, and microscopy'?

I think this data enrichment could help extrapolate and integrate efforts currently being conducted in 'We’ve put this effort on ice,' enabling researchers to mine data across multiple biological scenarios—including neuron development and mobility. I need to dive deeper, but a preliminary search helped me link flagellar function in Chlamydomonas with cilia-driven neuronal migration. Maybe a machine learning model could help detect patterns across different biological systems and reveal hidden connections.

As I mentioned in my first comment, I really appreciate the open research approach—this is exactly the kind of idea that benefits from collaborative and iterative exploration. That said, this is just an initial thought that I plan to refine further. Thank you for your time Megan.

Prachee Avasthi on Dec 19, 2024

Selection

back. We do receive new comments much more frequently than before, but it’s because in August 2024, we started requiring anyone applying to a scientific role to leave a comment as part of their evaluation. Before this, the median number of comments we received each month was four — now it’s 25! Almost a

We’re trying to select for scientists who can engage with our work at a deep level and successfully participate in open dialogue. We expect our scientists to do this with their own pubs, so evaluating this skill during our hiring process is important.

Robert Roth on Dec 19, 2024

Selection

ers tend to have FREs around 45 to 50 ) for each pub using the textstat Python package (v0.7.4) . The average FRE is about the same for pubs developed via either our original or new model (~43 for v1 pubs; ~40 in v2). We’ve seen potential accuracy issues with textstat’s implementation of FRE, especially related to s

We know that relying solely on FRE to judge quality has its limitations, especially with scientific publications. We're also exploring other tools like the Gunning Fog Index, SMOG, and others that may be better suited to the work we release. We're interested in combining these measures with qualitative feedback — through the forms on our pubs, public comments, and conversations with internal scientists — to better understand what makes our pubs high quality from a readability perspective.

I’d be curious to hear what you think about different readability measures for science publications and other ways to measure ‘quality.’

Abrahim El Gamal on Jan 13, 2025

In my experience, some scientists have a stronger penchant for reporting than others. While it might be counterproductive to push scientists to publish simply for the sake of publishing, utilizing project management tools to monitor project status could be a useful strategy. This approach might help the publishing group identify when a scientist is at a stage where publication makes sense. At that point, the publishing team could check in with the scientist to assess progress and, if appropriate, encourage them to start considering a publication if they haven’t already.

Megan L. Hochstrasser on Jan 14, 2025

Hi Abrahim! Your thinking is similar to mine, but we’ve been unable to track projects consistently. Our organizational practices have thus far been too dynamic to accommodate that structure. We do this with teams who’ve chosen to organize themselves in a manner conducive to it, but nothing is one-size-fits-all here and even what we establish with some confidence can shift quickly. I suspect that as our company grows and we’ll end up solidifying some baseline project management and reporting norms, and then the pub team can jump in to build on those.

Abrahim El Gamal on Jan 23, 2025

Hi Megan! That makes a lot of sense. I can definitely see how basic research might be more conducive to lightweight tracking as opposed to conventional product management systems. It’ll be interesting to see how these processes evolve as the organization grows!

Robert Roth on Dec 19, 2024

Selection

bout the same for pubs developed via either our original or new model (~43 for v1 pubs; ~40 in v2). We’ve seen potential accuracy issues with textstat’s implementation of FRE, especially related to scientific words that frequently appear in our pubs. These numbers should be consistent relative to each other, but may not reflect the true FRE of our

For more context, textstat uses Pyphen for syllable counting, which appears to have some accuracy issues (as described in this GitHub issue). We’re working on a fork that implements the CMU pronouncing dictionary with a basic fallback for words that don’t appear in it (and you can find other forks that implement various workarounds to this).

This still has some errors, but seems more accurate so far for certain words, especially ones that appear frequently in our pubs. In tests on our pubs, CMU (with fallback) errors tend to overcount syllables, while Pyphen errors generally result in an undercount.

I’d be interested to hear if anyone has experience with different syllable counting methods and tools, especially for scientific texts — I’ve been looking into other tools like SyllaPy and this regex implementation, but still see a few issues. CMU isn’t perfect either. There will always be inaccuracies as syllable counting is an inexact science, but we’d like to be as accurate as possible.

We’ll be working on this to improve its accuracy for scientific work and will release anything useful that we build.

Here are some example words, showing syllable count errors with both, comparing CMU textstat and original texstat:

Word: studying, CMU with fallback: 3, Original: 2, In CMUdict: True

Word: subtypes, CMU with fallback: 3, Original: 2, In CMUdict: False

Word: diseases, CMU with fallback: 3, Original: 2, In CMUdict: True

Word: chlamydomonas, CMU with fallback: 5, Original: 2, In CMUdict: False

Word: reinhardtii, CMU with fallback: 4, Original: 2, In CMUdict: False

Word: exhibiting, CMU with fallback: 4, Original: 3, In CMUdict: True

Word: motility, CMU with fallback: 4, Original: 3, In CMUdict: True

Word: whether, CMU with fallback: 2, Original: 1, In CMUdict: True

Word: method, CMU with fallback: 2, Original: 1, In CMUdict: True

Word: methodology, CMU with fallback: 5, Original: 4, In CMUdict: True

Word: libraries, CMU with fallback: 3, Original: 2, In CMUdict: True

Word: pipetting, CMU with fallback: 3, Original: 2, In CMUdict: False

Word: pipette, CMU with fallback: 2, Original: 1, In CMUdict: False

Robert Roth on Mar 11, 2025

Update: this ended up being a side-project for me (which is scireadability). It is not a perfect solution, but syllable counting accuracy is greatly improved. The main caveat is that sentence counting with efficient heuristics is difficult in texts that have many abbreviations (such as E. coli, or etc.), so this will remain an issue in scientific texts. As I continue developing the package, I’ll be trying to improve this aspect of its performance.

Brian Jones on Feb 05, 2025

Publication pre-planning

Before a project launches, lead scientists could draft a Visibility & Engagement outline that includes specific communities, platforms, and KOLs to target for engagement outreach once a project is complete (or iced).

Additionally, has Arcadia tried seeding engagement by previewing project/pub findings before publishing? This approach might help boost awareness (and anticipation) of a particular publication and increase future engagement.

Visibility/Engagement

Has Arcadia considered doing a “pub spotlight” for newly released or high-traffic articles? To direct more traffic to the pubs, one could place a small “new pub release” banner at the top of the homepage directing visitors to the pub. To further capitalize on the momentum from high-traffic articles, it might also be helpful to be able to sort the pubs by unique page views or “worked for me” (if Arcadia were to implement such a button).

While most of science is communicated through text and images, video has become an increasingly powerful medium to increase visibility and engagement on platforms. One could imagine one short video per project/pub where a scientist briefly describes their work’s goals, key findings, and impact at a high level. Other visual elements like animations could also be employed. Video content like this makes for easy sharing across platforms and is often more easily consumed than text.

Similarly, while Arcadia is active in communicating its mission and projects in podcast spaces, has Arcadia considered making its own podcast? Having produced one myself, it is not technically difficult and might be useful to have a singular audio source where each of Arcadia’s projects can be showcased in ~30 minute interviews with lead scientists.

Lastly, when one of Arcadia’s projects or publications are built upon by an external group, it might help to create content around the work as a case example of Arcadia’s impact. Similar to the “pub spotlight” approach, this might help drive further engagement and highlight Arcadia’s mission in action.

Website user interface/experience design

Something that stood out to me while reading this publication and the comments was how many tried and true approaches have already been implemented to boost pub visibility and reader engagement. That being said, one might consider using current website traffic/viewing data to experiment with the user interface/experience design of the website as small changes can dramatically impact how a user engages with web content.

For example, while pubs currently have survey questions listed in a box at the very end of the article, it is easy to miss. Changing the survey format from a fixed object to a dynamic pop-up would guarantee it is placed directly in front of the viewer and may increase feedback frequency. Combining this approach with scroll mapping data could also inform where on the page to display the pop-up for maximal visibility.

More generally, it might also help to do a deeper dive into click engagement data as an indicator of effective website design. For example, are readers clicking on tag links at the top of articles to explore new pubs? If not, would a visual cleanup of the tag (font size, text placement, etc.) help increase their use?

Comment section

One last suggestion, but it might be helpful to add an “upvote” functionality to gauge comment engagement and impact.

Megan L. Hochstrasser on Feb 10, 2025

Hi Brian, thanks for these great ideas! While some are things that our scientists would have to opt into, there are definitely a few strategies here that pub team might be able to try on our own. We haven’t been able to get the types of website analytics or make the types of layout changes you’re referencing because of limitations to the platform, but we’re switching to an upgraded version of PubPub very soon that will open up way more possibilities.

Mac Robertson on Jan 14, 2025

When it comes to increasing engagement with publications, Arcadia could consider incorporating interactive elements, such as data visualizations, live simulations, or embedded discussion forums, directly within the articles. This would allow readers not only to engage with the content but also to explore the underlying data and models in real time. Development of these features could be added as a provided service, with these user-friendly interactive visuals acting as the “gateway figure” to the particular Pub. These interactive visuals/simulations are also very useful for improving engagement in the wider research community when shared in research spaces or science communication social media.

Additionally, developing a collaborative space where scientists can leave feedback, share insights, and suggest improvements could further enhance community-driven engagement. Offering incentives, such as recognition for contributions, the opportunity to co-author follow-up publications, or even a LinkedIn style “endorsement” system could also motivate more active participation from the scientific community.

Lastly, from my personal experience, a very good way to get scientists to engage in one another’s research and let their passions speak to one another is through events that have a more casual tone than conference, such as receptions or coffee hours. Of course these are things that often exist at conferences, but being able to harness excitement and social tone of such an event without the circumstance of a conference could have a very positive effect on scientist engagement, reminiscent of a modern-day enlightenment salon. There could even be a very straightforward incentive in this case of getting a free treat in return for commenting on another scientist’s pub, because if I learned one thing from my scientific career it’s that scientists love a free coffee!

Megan L. Hochstrasser on Jan 21, 2025

Love these ideas. We’ve included interactive figures in a few pubs where the data were best explored this way and would love to do more. We’re about to transition to the new version of PubPub, called “Platform,” which should enable much more customization in how we convey information and invite readers to contribute.

The main struggle we’re having with engagement in our 2.0 system so far is that our scientists are ultimately the ones in charge of choosing and implementing an engagement strategy, and this isn’t something they tend to do without a lot of prompting. It takes a lot of work, and even ideas that seem like smart strategies often yield very little interaction, so it makes sense that without hand-holding or someone taking engagement off their plate, our scientists skip it. That said, the pub team can still suggest ideas like those you mentioned, and we hope some of our scientists try them!

Gabriela Menezes on Feb 04, 2025

Open science and scientific outreach should be a goal for all scientists who care about science as a way to improve people's lives. It is amazing that Arcadia has been working to strengthen this policy within the company, not only by providing a platform for its scientists to do this but also by studying ways to improve viability, such as v2 presented here. This idea also serves as a public report on the progress of Arcadia's research. This information is important for the public, scientists, and Arcadia applicants. It is a great idea that should be replicate!

Prachee Avasthi on Feb 04, 2025

Thanks for the kind words! We do hope that others learn what might work for them based on our approach and accelerate their own science by openly sharing their research products.

Nithesh Chandrasekharan on Feb 07, 2025

Selection

rated comments, 231 comment impacts) and v2 pubs (n = 28 rated comments, 29 comment impacts) ().In summary, with less outreach to external scientists, fewer people see and build on our work. We do get more comments because we require this of job applicants, but they’re not incredibly usefu

Hi, I have a couple of ideas for how Arcadia could potentially increase engagement from the public and get feedback.

- Can we introduce the concept of PhD students giving feedback as part of a journal club at the university level? Many PhD students take seminar courses or enroll in journal clubs to review literature and there is an opportunity to maybe introduce the idea of reading Arcadia research and providing comments. If current scientists and other leaders at Arcadia have connections to universities (probably within the Bay Area to start but also the entire nation/world), it could really bring the Arcadia research to the scientific community rather than waiting for the community to seek it out. PhD students are constantly looking to improve their scientific knowledge and to learn how to review publications, so they are a potential untapped market. This idea could potentially extend to senior undergraduate courses, especially those that are considering scientific research as a career path. This could potentially lead to a lot more comments but would also take a lot more effort to tease out the impactful ones. At the same time, teaching the younger generation of scientists about open science would be a great asset for the future.

- Maybe we need to find new outlets for showcasing the research outside of conferences. Can we leverage social media to our advantage such as creating ~1min videos of the research and sharing on Twitter/LinkedIn/Tiktok. These ~1min videos could also act as abstracts or summaries directly on the publication site and social media sites, giving the public a taste of the research and then they’ll hopefully follow up with the actual paper and provide comments. By distilling the publications into bite sized clips could allow the public to engage at a quicker pace as we’re moving forward towards making these more accessible with the newer technologies/media arising. Even adding something as a “Like/Dislike” button could potentially show that readers are engaging.

- Maybe we need to readjust the model, going back to finding a dedicated team member(s) to oversee engagement while letting the scientists focus on the experiments and writing while giving advice to the dedicated engagement coordinator. I think it’s important to reevaluate WHY the past/current methods aren’t working rather than just stating the results that this system didn’t work. Ask the scientists, “was it too much on their plate, what were their struggles (at conferences for example)”, and WHY didn’t the previous strategy of using a dedicated team member work. Revisiting a previous strategy and building upon it with new ideas can show major differences.

Megan L. Hochstrasser on Feb 10, 2025

Thanks, Nitesh! I’d love for journal clubs to cover some of our pubs. We didn’t get into details on prior engagement strategies since it wasn’t the focus of this pub, but pub team does support engagement now and we can scale based on the need, including by adding another team member. It would need to come based on what our scientists are asking for, though, rather than being something we impose in a more top-down way.

Contributors (A-Z)

Purpose

Share your thoughts!

Background on Arcadia’s evolving publishing model

The results

What’s going well

The process is streamlined and scientists are in charge

Pub “quality” doesn’t seem to have decreased

With less responsibility per pub, the pub team has more time to create core resources

Challenges

Some managers are more overwhelmed

The usefulness of external engagement is unchanged

Methods

Key takeaways

The good

The bad

The ugly

Fast-forward: How have things changed more than a year into v2.0?

Pubs are starting to go much faster

Quality and external engagement metrics are similar

Commercialization decisions still aren’t delaying pubs

A note on data

Are the trade-offs worth it for us?

How might you benefit from what we’ve learned?

Balance agency with accountability

Lean on tooling and automation

If you want feedback, engage

What’s next?

References

Share your thoughts!

Provide feedback

Pub details

Table of contents

Contributors (A-Z)

Purpose

Share your thoughts!

Background on Arcadia’s evolving publishing model

The results

What’s going well

The process is streamlined and scientists are in charge

Pub “quality” doesn’t seem to have decreased

With less responsibility per pub, the pub team has more time to create core resources

Decision-making around commercialization has matured and hasn’t delayed sharing

Challenges

Pubs take longer, so we share less often

Some managers are more overwhelmed

The usefulness of external engagement is unchanged

Methods

Key takeaways

The good

The bad

The ugly

Fast-forward: How have things changed more than a year into v2.0?

Pubs are starting to go much faster

Quality and external engagement metrics are similar

Commercialization decisions still aren’t delaying pubs

A note on data

Are the trade-offs worth it for us?

How might you benefit from what we’ve learned?

Balance agency with accountability

Lean on tooling and automation

If you want feedback, engage

What’s next?

References

Share your thoughts!

Provide feedback

Pub details

Table of contents