character-encode

发表于2024-12-06|更新于2026-06-10|Java

|浏览量:

Text Encode & Decode

ASCII

ANSI(美国国家标准学会) 推出了 ASCII

ASCII 包括 0-9 a-z A-Z !”#$%… 控制字符

码点 code point

每个字符对应的数字

ASCII: 0-127(2^7)

字符集 charset

标准所支持的所有字符及其对应码点的集合，称之为字符集

编码 encode

从字符到计算机存储的的二进制的过程，称之为编码
字符编码规则不一定会把码点直接转换成二进制存储在计算机中

在 ASCII 和 UTF-32 中，会把码点直接转换为二进制

Unicode

囊括了各国文字、Emoji、象形文字的字符集

UTF-32

每个字符使用 4 字节存储，不够 32 bit 的向前补零

UTF-8

Unicode 的可变长度编码，码点越大，编码后的二进制越长

乱码的原因

编码规则与解码规则不同
部分编辑器将 Unicode 中无法识别或展示的字符自动替换为特殊符号，在保存文件时，将 EF BF BD 写入
- Unicode 字符集中有一个特殊的替换符号，专门用于表示无法识别或展示的字符

文章作者: xhj

文章链接: https://hzhzxfs.github.io/2024/12/06/character-encode/

版权声明: 本博客所有文章除特别声明外，均采用 CC BY-NC-SA 4.0 许可协议。转载请注明来源 xhj的博客！

相关推荐

springmvc-notes

SpringMVC笔记SpringMVC简介Spring 为展现层提供的基于 MVC 设计理念的优秀的 Web 框架，是目前最主流的 MVC 框架之一 SpringMVC属于Spring框架中的web模块 HelloWorld流程:1 导包 1234567891011121314日志记录commons-logging-1.1.3.jar支持注解spring-aop-4.0.0.RELEASE.jarSpring核心容器模块spring-beans-4.0.0.RELEASE.jarspring-context-4.0.0.RELEASE.jarspring-core-4.0.0.RELEASE.jarspring-expression-4.0.0.RELEASE.jarSpring Web模块spring-web-4.0.0.RELEASE.jarspring-webmvc-4.0.0.RELEASE.jar 2 写配置配置springmvc的前端控制器，指定springmvc配置文件位置 WEB-INF/web.xml: 12345678910111213141...

git笔记git的基本操作config的3个作用域local:只对当前仓库有效global:对登录用户所有仓库有效system:对系统的所有用户有效 3个作用域的优先级： 12345678git config --list --local # 查看版本库范围的所有设置(若不指定local，则显示所有范围的设置)git config --list --global # 查看global范围的设置参数git config --list --system # 查看system范围的设置git config --list user.name # 只显示user.name的值git config --local user.name 'username' # 修改local(版本库)范围的user.name(若不加变量作用域，则默认为local)git config --global use...

Tomcat NoteServlet 容器和 Spring/SpringMVC 容器之间的关系Tomcat&Jetty 在启动时给每个 Web 应用创建一个全局的上下文环境，这个上下文就是 ServletContext，其为后面的 Spring 容器提供宿主环境。 Tomcat&Jetty 在启动过程中触发容器初始化事件，Spring 的 ContextLoaderListener 会监听到这个事件，它的 contextInitialized 方法会被调用，在这个方法中，Spring 会初始化全局的 Spring 根容器，这个就是 Spring 的 IoC 容器，IoC 容器初始化完毕后，Spring 将其存储到 ServletContext 中，便于以后来获取。 Tomcat&Jetty 在启动过程中还会扫描 Servlet，一个 Web 应用中的 Servlet 可以有多个，以 SpringMVC 中的 DispatcherServlet 为例，这个 Servlet 实际上是一个标准的前端控制器，用以转发、匹配、处理每个 Servlet 请求...

为什么动态代理对象proxy的System.out.println(proxy)与System.out.println(proxy.getClass)的输出结果不同

为什么动态代理对象proxy的System.out.println(proxy)与System.out.println(proxy.getClass)的输出结果不同起因在学习Spring的AOP面向切面编程时,有这么一个例子 Calculator接口 123456public interface Calculator { int add(int a, int b); int sub(int a, int b); int mul(int a, int b); int div(int a, int b);} Calculator的实现类 123456789101112131415@Servicepublic class CalculatorImpl implements Calculator { public int add(int a, int b) { return a + b; } public int sub(int a, int b) { re...

JDBC驱动类加载1Class.forName("com.mysql.cj.jdbc.Driver"); Connection与特定数据库的连接（会话）。执行SQL语句并在连接的上下文中返回结果。Connection对象的数据库能够提供描述其表，其支持的SQL语法，其存储过程，此连接的功能等的信息。 1Connection connection = DriverManager.getConnection("jdbc:mysql://localhost:3306/db4", "deltav", "testpass"); Statement接口用于向数据库提交SQL语句 Statement123Statement statement = connection.createStatement();String sql = "SQL_STATEMENT";statement.execute(sql); 使用addBatch(SQL)批量添加数据到Statement对象中，使用exe...

HTML NotesChore常用HTML标签的英文全称及简单描述 HTML标签英文全称中文释义 a Anchor 锚 abbr Abbreviation 缩写词 acronym Acronym 取首字母的缩写词 address Address 地址 alt alter 替用(一般是图片显示不出的提示) b Bold 粗体（文本） bdo Direction of Text Display 文本显示方向 big Big 变大（文本） blockquote Block Quotation 区块引用语 br Break 换行 cell cell 巢 cellpadding cellpadding 巢补白 cellspacing cellspacing 巢空间 center Centered 居中（文本） cite Citation 引用 code Code 源代码（文本） dd Definition Description 定义描述 del Deleted 删除（的文本） dfn Defines a...