LLM benchmarks, evals and tests: A mental model
31 Oct 2024
30 Oct 2024
The Common Vulnerabilities and Exposures (CVE) program, launched in late October 1999, has not only marked its presence but has become a pivotal force in shaping how we perceive and manage cybersecurity threats. A Journey Through Time The CVE program emerged as a beacon, standardizing how vulnerabilities are identified, shared, and mitigated. From its inception with just 321 entries, it…
Explore ICMP: the essential protocol for network diagnostics, error handling, and connectivity checks. A must-know for developers tackling networking!
Learn how to build a free, simple web-based chat app utilizing different technologies and leveraging building in-house or using an IaaS.
A factuality benchmark called SimpleQA that measures the ability for language models to answer short, fact-seeking questions.
29 Oct 2024
Enhance your data literacy with Data Quality Management. Learn how accurate and reliable data can drive informed decisions and insights
There's a new feature in macOS Sequoia that allows mirroring of your iPhone to your Mac. This is a nice feature that allows you to fully interact with your iPhone as an app on you Mac's desktop. When I upgrade to Sequoia and run the iPhone Mirroring app, it selected Victory, my old iPhone 14 Pro Max which is running…
Decagon and OpenAI deliver high-performance, fully automated customer support at scale
Driving public sector modernization: aligning technology, strategy, and citizen needs
Thoughtworks InsightsDriving public sector modernization: aligning technology, strategy, and citizen needs
28 Oct 2024
Maximize user satisfaction and retention with RUM, gain insights, reduce friction, and enhance experiences based on real-world interactions
Introduction Large language models are fantastic tools for unstructured text, but what if your text doesn’t fit in the context window? Bazaarvoice faced exactly this challenge when building our AI Review Summaries feature: millions of user reviews simply won’t fit into the context window of even newer LLMs and, even if they did, it would […]
See what improvements we’ve made this autumn
There's a common narrative that Microsoft was moribund under Steve Ballmer and then later saved by the miraculous leadership of Satya Nadella. This is the dominant narrative in every online discussion about the topic I've seen and it's a commonly expressed belief "in real life" as well. While I don't have anything negative to say about Nadella's leadership in this…
Cypress: Should we apply the page object model to optimize the framework?
Connect, learn and transform through technology
27 Oct 2024
I’ve been having problems for the last 3 years or so where Mess With DNS periodically runs out of memory and gets OOM killed. This hasn’t been a big priority for me: usually it just goes down for a few minutes while it restarts, and it only happens once a day at most, so I’ve just been ignoring. But last…
This post is an edited and expanded version of a now-mangled Mastodon thread. Contents Platforms Are Competitions ...and We're Losing Win Condition You Do It To Yourself, And That's What Really Hurts Groundhog Day Reboot Some in the JavaScript community imagine that I harbour an irrational dislike of their tools when, in fact, I want nothing more than to stop…
25 Oct 2024
Introduction In this article, we are going to see how we can use the LazyConnectionDataSourceProxy with Spring Data JPA to acquire the database connection as late as possible and, therefore, reduce transaction response time. For an introduction to how Spring transactions manage database connections, check out this article as well. Service Layer Connection Management Let’s consider we have the following…
He holds out a rubber band, expands it until it snaps and says, “Tension seeking resolution. That’s it. That’s the key. The fundamental…
Hear from our customers across the globe
Bringing generative AI to bear on legacy modernization in insurance
Central bank digital currencies: What’s really in them for banks?
24 Oct 2024
Phuong Van explores Phoenix LiveView implementation, covering data migration, UI development, and team collaboration from concept to production. The post Implementing Phoenix LiveView: From Concept to Production appeared first on Erlang Solutions.
Safeguarding the future of frictionless banking against fraud
23 Oct 2024
Unlock data insights to make smarter decisions! Discover how to extract valuable trends, patterns, and actionable strategies from your data
Unlock the power of real-world data with a comprehensive dive into Data Acquisition (DAQ) systems, essential for IoT, automation, and research
New data explores how and when open banking payments will hit mainstream status
We’ve simplified, stabilized, and scaled continuous-time consistency models, achieving comparable sample quality to leading diffusion models, while using only two sampling steps.
As cyber risk grows and AI reshapes the threat landscape, security leaders are being asked to do more than protect systems: They must also support the business. In this episode of the Executive Guidance podcast, Brad Moldenhauer, VP and CISO in Residence at leading cloud security company Zscaler, joins Zohra Ibrahimi, Toptal’s Cyber and Information Security Practice Lead, to discuss…
As AI transforms how buyers discover and evaluate solutions, traditional SEO playbooks are being rewritten. In this episode of the Executive Guidance podcast, Kevin White, Head of Marketing at Scrunch, joins Jeff Gangemi to explore how LLMs are reshaping intent, discovery, and conversion, and what marketers should do next.
CentralSquare’s Chief Product Officer, Brian Fugere, is helping public safety agencies and local governments evolve. In this episode of the Executive Guidance podcast, Brian shares how his team is building cloud-native, AI-powered platforms to support emergency response, administration, and citizen services.
Koley Corte, MetLife SVP and CMO of Regional and Institutional Marketing, Global Customer Insights, leads one of the most expansive marketing organizations in financial services. In this episode of the Executive Guidance podcast, she shares how she aligns global teams around customer insight, strategic execution, and purpose-driven leadership.
Avid’s Chief Marketing Officer, Angela Stelle, shares how she’s reshaping the company’s brand, culture, and creative engine for the future. In this episode of the Executive Guidance podcast, Angela reflects on lessons from her second CMO role, the power of executive presence, and how to lead with both rigor and empathy.
Lumivero CMO Tessa Barron joins the Executive Guidance podcast to share her unique approach to growth-driven marketing. Discover how Tessa builds scalable marketing functions grounded in strategic planning, audience ownership, and a deep understanding of business outcomes.
Learn how Jason Smith, AI Strategy Lead at Publicis Media, is helping clients navigate the fast-changing world of AI in media and advertising. In this episode of the Executive Guidance podcast, Smith shares practical insights on applying AI to audience targeting, creative automation, and responsible governance.
Discover how Emily Jackson, SVP of eCommerce at Forbes, is shaping the next era of digital publishing. In this episode of the *Executive Guidance* podcast, she shares insights on how commerce content became a key revenue driver, the evolving role of AI in product discovery, and how Forbes balances editorial trust with business growth.
Discover how Ricoh’s Vice President of Portfolio Software Solution Development, Ashok Shenoy, is leading AI-driven transformation by focusing on people, process, and data. In this episode of the Executive Guidance podcast, Shenoy shares how Ricoh built an AI-ready culture, overcame adoption challenges, and scaled AI to improve business operations.
Macro trends in the tech industry | October 2024
Reckoning with generative AI’s uncanny valley
Humanizing your data strategy: Seven key ideas for the AI era
22 Oct 2024
It’s entirely possible that I’ve posted something on this topic before. I know I’ve posted about it on social media before. Every so often – thankfully not too often – I see a post on Stack Overflow containing something like this: “This looks like a bug in VS.NET” “I’m 100% sure my code is correct” … Continue reading No, the…
Finally, several posts in, I’m actually going to start talking about abstraction using DigiMixer as the core example. When I started writing DigiMixer (almost exactly two years ago) I didn’t expect to take so long to get to this point. Even now, I’m not expecting this post to cover “everything about abstraction” or even “all … Continue reading Abstraction: Introduction…
Node.js 23 is here, and it’s bringing some exciting changes and improvements to the runtime. With this release, Node.js 23 takes over as the ‘Current’ version, replacing Node.js 22, which is moving into Long-Term Support (LTS) later this month. What’s New in Node.js 23 Default ESM in require() One of the biggest updates in Node.js […] The post Node.js 23:…
I recently installed Python 3.13.0 rc2 to test rst2pdf against it and found that I couldn't install PyMyPDF which is required for the tests. $ pip install pymupdf ... This is because for a pre-release version, binary wheels are not provided to PyPI for the mupdf dependency which is written in C++. Hence, the compiler needs the headers. On my…
OpenAI and the Lenfest Institute AI Collaborative and Fellowship program
Unlocking the future of health: Revolutionizing patient data management
21 Oct 2024
Uncover hidden patterns, spot anomalies, and validate assumptions with powerful exploratory data analysis and stunning visuals—before modeling!
Discover how anomaly detection identifies unusual patterns in data, enhancing security, risk management, and quality across various industries.
Putting MLOps into practice effectively with the help of Databricks
19 Oct 2024
Is it time for a career change? Use the 3P formula (Paisa, Power, Position) to evaluate your job and unlock your career growth potential.
Recently I needed to add some keyboard shortcuts to an app I was working on. I wrote up some example code and decided to write this article…
17 Oct 2024
Heroku is a powerful general-purpose PaaS offering, but when combined with the broader Salesforce portfolio, it excels in unlocking and unifying customer data, regardless of its age, location, size, or structure. Salesforce customers turn to Heroku when they need to leverage high data volumes from sources such as consumer web or mobile apps or when […] The post How to…
Learn how Erlang Solutions helped companies like TV4, FACEIT, and BET Software overcome tech challenges and achieve success. The post Client Case Studies with Erlang Solutions appeared first on Erlang Solutions.
16 Oct 2024
Discover the latest release of Llamafile 0.8.14, an open-source AI tool by Mozilla Builders. With a new command-line chat interface, enhanced performance, and support for powerful models, Llamafile makes it easy to run large language models (LLMs) on your own hardware. Learn more about the updates and how to get involved with this cutting-edge project. The post Llamafile v0.8.14: a…
D2C is not an eCommerce issue, it's corporate finance modernization
Driving innovation at Thoughtworks: The Power of Hackathons
15 Oct 2024
Kristin Lauter and her colleagues at Facebook research recently announced a project to benchmark attacks against LWE. The announcement was on the post-quanum crypto mailing list. They state: “Our approach is motivated by the need to study more carefully the effect on security of using small secrets and small error in standardized LWE settings like Kyber and Homomorphic Encryption. In…
In our last blog about our Quality Assurance (QA) team, we gave an overview of the QA process, including our software and testing methods. One of our key tools during testing is the Materialize Emulator, a Docker image that allows you to maintain a locally hosted version of Materialize. Read the rest of the blog post over on the Materialize…
We've analyzed how ChatGPT responds to users based on their name, using AI research assistants to protect privacy.
Last week, Dave Winer posted that Scripting News turned 30. That's an amazing milestone and Dave's still writing daily. 1994 seems like quite a while ago; the web was only 3 years old! I started blogging in 2003 on a personal domain and added this blog in 2005. I don't write daily though and I tip my hat to Dave.…
What a year in the trenches building with LLMs taught me about delivering value with AI Artificial intelligence (AI) is rapidly becoming an integral part of modern engineering. From automating mundane tasks to driving groundbreaking innovations, AI offers unprecedented opportunities for organizations to gain a competitive edge. I’ve spent over a year in the trenches building with LLMs and even…
14 Oct 2024
Introduction In this article, we are going to analyze how the RESOURCE_LOCAL JPA transaction type works. Since this is the default transaction type when using Spring Boot or Spring Data JPA, it’s very important to understand how transactions are managed when using the RESOURCE_LOCAL mode. JPA Transaction Types When the JPA 1.0 specification was released, there were two transaction types…
13 Oct 2024
Master mobile push notifications to boost app engagement, retention, and conversions. key strategies and technical skills to enhance your app
11 Oct 2024
Continuity Camera is the macOS feature that allows you to use an iPhone as a webcam.
10 Oct 2024
We’ve been working to bring components of Quip’s technology into Slack with the canvas feature, while also maintaining the stand-alone Quip product. Quip’s backend, which powers both Quip and canvas, is written in Python. This is the story of a tricky bug we encountered last July and the lessons we learned along the way about…
We introduce MLE-bench, a benchmark for measuring how well AI agents perform at machine learning engineering.
Fintech open source is transforming the industry, offering flexible, scalable, and cost-effective solutions for businesses looking to innovate and stay competitive. The post Why Open Source Technology is a Smart Choice for Fintech Businesses appeared first on Erlang Solutions.
Two minutes for mental health
9 Oct 2024
If your cloud application performs poorly or is unreliable, users will walk away, and your enterprise will suffer. To know what’s going on inside of your million-concurrent-user application (Don’t worry, you’ll get there!), you need observability. Observability gives you the insights you need to understand how your application behaves. As your application and architecture scale […] The post Best Practices…
I’ve been using Gumroad for payments of GoReleaser Pro since the beginning, around May 2021.
Turns out, creating an HTML table that has both horizontal scroll and fixed headers can be a tricky problem. As another developer said about…
8 Oct 2024
Hearst’s iconic brands bring curated lifestyle and local news content to OpenAI’s products.
In my Makefile, I check for OpenAPI spec issues with this command: docker run --rm -it -v $(PWD):/tmp stoplight/spectral lint \ --ruleset /tmp/spec/.spectral.yaml /tmp/spec/openapi.yaml When running in GitLab CI, we set the image to stoplight/spectral:latest, and override the entry point so that we can run spectral directly: openapi-lint: stage: test image: name: stoplight/spectral:latest entrypoint: [""] script: - spectral lint --ruleset…
New agreement includes GoCardless’ ‘Same Day Settlement’ feature to help savers reach their financial goals more quickly
7 Oct 2024
TCP or UDP? Dive into the key differences, performance trade-offs, and when to choose each for your app's networking needs in this developer guide!
The fintech industry has exploded over the past few years, transforming how we (and financial institutions) manage, move, and invest money.
Introduction Welcome to a new issue of the High-Performance Java Persistence Newsletter in which we share articles, videos, workshops, and StackOverflow answers that are very relevant to any developer who interacts with a database system using Java. Articles The pick of this week is this article about B+Tree indexes, which features a live tool that we can use to see…
An introduction to the AHA Stack: Astro, HTMX, Alpine.js, and Supabase.
Cultivating thought leaders
6 Oct 2024
Need to expose your local dev server? Discover secure, flexible tools like Localtunnel, Cloudflare Tunnel, and frp for easy remote access and testing!
4 Oct 2024
The October 2024 release of the Python and Jupyter extensions for Visual Studio Code are now available. This month's updates include Python test coverage support and a default Python problem matcher. Keep reading to learn more! The post Python in Visual Studio Code – October 2024 Release appeared first on Microsoft for Python Developers Blog.
Digital systems have become deeply integrated into many aspects of modern life, particularly within the financial sector. While digital banking simplifies day-to-day operations for clients, it also creates new opportunities for malicious actors to exploit these systems. As a result, money laundering has grown particularly prevalent due to this digital expansion. Banks are required to monitor for money laundering activities…
Development of Blender 4.3 has reached the Beta stage. Please test and report any issues you may find!
Like most places, Jane Street largely teaches developers through a kind of apprenticeship model. A team matching process tries to thoughtfully match new devs to a team that suits them; and from there carefully chosen projects, one-on-one mentorship, code review, and close collaboration with people “on the row” – teammates sitting near you – does most of the rest.
3 Oct 2024
Explore the NonStop architecture's influence on Elixir, Gleam, and Erlang developers. Learn about modularity, fault containment, and process-pairs design for resilient software systems. The post Why do systems fail? Tandem NonStop system and fault tolerance appeared first on Erlang Solutions.
Introducing canvas
In addition to securing $6.6 billion in new funding from leading investors, we have established a new $4 billion credit facility with leading banks, including JPMorgan Chase, Citi, Goldman Sachs, Morgan Stanley, Santander, Wells Fargo, SMBC, UBS, and HSBC.
Why data will define value for insurers - and how to harness its full potential for success
Thoughtworks InsightsWhy data will define value for insurers - and how to harness its full potential for success
2 Oct 2024
We are making progress on our mission to ensure that artificial general intelligence benefits all of humanity.
Discover the power of balanced leadership - blending comfort and competition to drive team success and avoid burnout.
The new version of our vector data editor makes creating and publishing your own vector geospatial data easier than ever.
1 Oct 2024
The Two Billion Claps Bug TL;DR A user was able to exploit a race condition in our backend system to manipulate clap counts on posts. Users are supposed to only be able to clap between 0 and 50 times for a given post, but this hack allowed them to go outside those bounds (both above and below). Our fix leverages…
Developers can now build fast speech-to-speech experiences into their applications
Developers can now fine-tune GPT-4o with images and text to improve vision capabilities
Offering automatic discounts on inputs that the model has recently seen
Fine-tune a cost-efficient model with the outputs of a large frontier model–all on the OpenAI platform
Sometimes you need some additional DNS entries in your containers. This is how to do it in compose.yaml. Internal entries Within the containers, the name of the container in compose.yaml is resolvable via DNS. Given this compose.yaml: services: web: # ... app: # ... networks: myapp: driver: "bridge" We can ping web from within the app container. If we need…
Altera uses GPT-4o to build a new area of human collaboration